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PEEFACE 


rpHE  composition  of  this  book  requires  some  explanation, 
Chapters  I,  II,  and  III  of  Part  II  originally  formed  a 
doctorate  thesis  *  on  "  The  Use  of  the  Theory  of  Correlation  in 
Psychology,"  and  were  privately  printed  as  such  in  August  of  last 
year.  The  other  chapters  of  the  book  have,  as  it  were,  been 
written  round  this  nucleus,  with  a  view  to  producing  a  concise 
yet  fairly  complete  account  of  the  various  quantitative  methods 
employed  in  psychology.  The  slightly  controversial  nature 
of  the  thesis  is  intentionally  retained,  since  the  interests  of 
quantitative  psychology  would  seem  to  be  best  served  thus — 
at  least  until  the  biometric  methods  of  Professor  Karl  Pearson 
and  his  school  become  more  generall}^  known  among  psycho- 
logists than  they  are  at  present.  Perhaps  it  is  as  well  to  add 
that  I  am  alone  responsible  for  the  views  therein  expressed. 

Although  written  primarily  for  the  professed  psychologist,  it 
is  hoped  that  the  book  will  also  prove  of  use  to  the  educationist. 
The  theory  of  correlation  is  likely  to  be  found  especially  valuable 
in  educational  psychology,  if  only  its  presuppositions  are  clearly 
grasped  and  the  requisite  precautions  in  its  application  exercised ; 
and  now  that  a  real  training  in  psychology  is  beginning  to  be 
regarded  as  an  essential  part  of  a  teacher's  equipment,  we  may 

*  Approved  for  the  Degree  of  Doctor  of  Science  in  the  University  of  London. 
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expect  valuable  contributions  to  psychology  froni  teachers  in 
the  immediate  future.  If  this  book  is  found  helpful  on  the 
quantitative  side  to  such  research  students,  its  existence  will 
be  justified. 

My  thanks  are  due  to  the  Editors  of  the  British  Journal  of 
Psychology,  who  have  very  kindly  allowed  me  to  re-publish  an 
article  from  that  periodical. 

W.  B. 


Mai/,  1911. 
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PART  I 

PSYCHO-PHYSICS 


CHAPTER  I 

MENTAL  MEASUREMENT 

The  pre-conditions  of  measurement  in  any  sphere  of  ex- 
perience are  (1)  the  homogeneity  of  the  phenomenon,  or  of  any 
particular  aspect  of  it,  to  be  measured,  (2)  the  possibility  of 
fixing  a  unit  in  terms  of  which  the  measurement  may  be 
made,  and  of  which  the  total  magnitude  may  be  regarded  as 
a  mere  multiple  or  sub-multiple.  These  pre-requisites  are 
satisfied  in  the  cases  of  spatial  and  temporal  magnitudes,  in 
terms  of  which,  directly  or  indirectly,  all  the  measurements  of 
the  physical  sciences  are  expressed.  It  was  thought  by 
Fechner  that  they  also  applied  in  the  case  of  the  strictly 
psychical  phenomenon  of  sensation-intensity,  i.e.  it  was  assumed 
that  any  given  sensation-intensity  might  be  regarded  as  made 
up  of  a  sum  of  unit  sensation-intensities.  This  view  has  been 
definitely  rejected  by  later  psychologists.  Every  sensation- 
intensity  is  qualitatively  distinct  from  every  other  sensation- 
intensity.  No  addition  or  subtraction  of  intensities  is  possible. 
"  To  introspection,  our  feeling  of  pink  is  surely  not  a  portion  of 
our  feeling  of  scarlet ;  nor  does  the  light  of  an  electric  arc  seem 
to  contain  that  of  a  tallow-candle  in  itself"  (James)*.  Fechner's 
mistake  was  due  to  a  confusion  of  sensation-intensities  with 
the  (physical)  stimulus-values  required  to  produce  them.  The 
latter  can  be  measured,  the  former  cannot. 

Nevertheless,  purely  psychical  measurement  is  not  entirely 
impossible.    Within  any  one  series  of  sensation-intensities, 

*  Principles  of  Psychology,  Vol.  i.  p.  546. 
B.  1 
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e.g.  a  series  of  greys,  the  contrasts  or  "distances"  separating 
different  pairs  of  intensities  are  perfectly  homogeneous  with 
one  another  and  can  be  measured  in  terms  of  one  another  or  in 
terms  of  an  arbitrarily  chosen  unit  of  "  sense-distance."  Given 
two  brightness-intensities  a  and  h,  it  is  quite  possible  to  find, 
within  limits  of  error,  a  brightness-intensity  c  which  is  as 
much  higher  than  h  in  the  scale  of  intensities  as  h  is  than  a, 
I.e.  such  that  the  sense-distance  he  =  the  sense- distance  ab ;  or, 
again,  it  is  quite  possible,  theoretically,  to  find  a  brightness- 
intensity  d  which  bisects  the  sense-distance  ab,  i.e.  which  is 
such  that  it  is  as  far  removed  from  a  in  the  scale  of  intensities 
as  b  is  fi-om  it— in  symbols,  ad  =  db.    Hence  the  "  distance,"  or 


a 

Scale  of 


brightness-intensities. 

Fig.  1. 

disparity  of  b  from  a  is  twice  that  of  d  from  a,  the  distance  of  c 
from  a  is  four  times  that  of  d  from  a.  If,  now,  ad,  or  the 
distance  of  d  from  a,  be  taken  as  a  conventional  unit,  the 
values  of  ab  and  ac  will  be  2  and  4  respectively*. 

A  scale  of  intensities  may  in  this  way  be  formed  rising  by 
"  equal-appearing  intervals  "  or  sense- distances,  and  the  magni- 
tude of  any  given  interval,  may  then,  theoretically,  be  read  off 
in  terms  of  the  unit-distance  employed  in  the  construction  of 
the  scale.    In  practice,  however,  it  is  found  more  convenient  to 
fix  the  successive  scale-marks,  the  successive  members  of  the 
intensity-series,  in  terms  of  their  corresponding  stimulus-values. 
It  has  been  found  by  experiment  that  in  the  case  of  light- 
intensities  and  sound-intensities  the  successive  stimulus-values 
form,  with  fair  approximation,  a  geometric  progression,  or,  in 
other  words,  each  stimulus  value  divided  by  the  immediately 
preceding  one  gives  approximately  the  same  quotient.  From 
the  stimulus-values  corresponding  to  an  ascending  series  of 

*  This  view  of  mental  measurement  in  terms  of  sense-distance  first  originated 
with  J.  E.  L.  Delboeuf:  Revue  Philosophique  v.  1878,  p.  53.  His  term  for 
sense-distance  was  "contraste  sensible." 
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eight  equidistant  brightness-values,  Ebbinghaus  obtained  the 
following  series  of  quotients : 

2-3  2-1  2-1  1-8  1-7  1-7  20 
The  quotient-value  is  not  entirely  constant,  being  slightly 
greater  towards  the  two  ends  of  the  scale  than  it  is  at  or  about 
the  middle.  For  this  central  region  of  the  scale,  then,  the 
general  relation  of  the  sense-distances  to  the  stimulus- values  is 
given  by  the  logarithmic  formula : 

SS,  =  klog^^  (1)*, 

where  SS^  represents  a  sense-distance,  and  is  any  finite 
intensity  of  sensation  taken  as  the  starting  point  or  conven- 
tional zero  (N.B.  it  is  not  necessarily  liminal);  R,  are 
corresponding  stimulus- values. 

A  mode  of  procedure  which  not  only  admits  of  much  wider 
practical  application  than  the  above-mentioned  "method  of 
mean  gradations,"  but  also  possesses  a  peculiar  historical 
importance,  is  that  which  is  concerned  with  the  determination 
of  the  stimulus-increments  corresponding  to  just-noticeable 
increments  of  sensation-intensity  in  different  parts  of  the 
intensity  scale.  Weber  found,  in  a  series  of  experiments  chiefly 
with  lifted  weights,  that  this  stimulus-increment  was  relatively, 
not  absolutely,  constant  for  different  regions  of  the  intensity 
scale,  i.e.  that  the  stimulus  corresponding  to  any  original 
sensation-intensity  had  always  to  be  increased  by  a  constant 
proportion  to  arouse  a  just-noticeable  increment  of  the  sensa- 
tion-intensity. If  a  103  grms.  wt.  is  just  noticeably  heavier 
than  a  100  grms.  wt.,  then  the  wt.  just  noticeably  heavier  than 
a  200  grms.  wt.  will  be  a  206  grms.  wt.,  that  just  noticeably 
heavier  than  300  grms.  will  be  309  grms.,  etc. 

Mathematically  formulated,  Weber's  Law  is  : 
dR 

-g- =  a  constant   (2). 

dR 

The  quantity  _  ^  or  '03  in  our  example  of  weight- 
lifting— is  known  as  the  "relative  difference  limen"  (DL). 

*  Ebbinghaus. 

1—2 
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It  is  of  course  the  average  value  of  a  considerable  number  of 
determinations. 

Fechner  verified  Weber's  Law  in  many  different  realms  of 
sensation-intensity,  and  made  it  the  basis  of  his  own  system  of 
mental  measurement.  This  he  did  by  making  the  following 
three  assumptions : 

(1)  that  a  sensation-intensity  is  a  measurable  magnitude 
and  may  therefore  be  regarded  as  a  sum  of  unit-intensities; 

(2)  that  just-noticeable  differences  of  sensation -intensity 
are  equal  at  different  parts  of  the  stimulus  scale,  and  may 
therefore  conveniently  serve  as  the  unit-intensities  above- 
mentioned  ; 

(3)  that  the  just-noticeable  difference  of  sensation  may  be 
treated  as  a  difference  of  two  sensations,  or  at  least  that  if 
Weber's  Law  applies  to  the  former  ("  sensed  difference  ")  it  will 
also  apply  to  the  latter  ("  difference  sensation  "). 

On  the  basis  of  Weber's  Law  and  these  added  assumptions, 
Fechner  obtains  the  following  formula,  viz. 

dS=c^  (3) 

which  he  calls  the  fundamental  formula  for  mental  measure- 
ment.   Integrating,  this  becomes 

S=c\ogeB+C. 
Putting  R  =  r,  the  "stimulus  limen"  for  which  S  is  just  below 
the  threshold  of  consciousness,  i.e.  =  0,  and  we  have 

0  =  c  loge  r+C. 
It 

Subtracting,  ^  =  c  log^  — . 

Putting  r  =  1,  and  transferring  to  the  ordinary  logarithm  system, 
we  get 

8  =  k\ogR   (4). 

All  the  assumptions  involved  are  questionable.  The  first 
one  has  already  been  considered  at  some  length.  It  is  not  the 
single  sensation-intensity  which  is  measurable,  but  the  distmct- 
ness,  disparity  or  distance  of  one  sensation-intensity  from 
another.  Any  intensity  taken  by  itself  is  0,  it  is  at  a  zero 
distance  from  itself,  ^  =  0  [see  equation  (1)  above]. 
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We  must  therefore  regard  the  just-noticeable  difference, 
not  as  a  difference  of  two  sensation-intensities  but  as  a  minimal 
sense-distance,  if  we  are  to  be  able  to  make  use  of  it  in  our 
scheme  of  mental  measurement.  This  modification,  however, 
still  leaves  us  involved  in  the  difficulties  of  assumptions 
(2)  and  (3). 

Fechner's  own  reason  for  regarding  all  just-noticeable 
diflFerences  belonging  to  any  one  scale  of  intensities  as  equal 
was  that  they  appear  equal  to  introspection.  Introspection  in 
a  case  like  this  is  obviously  difficult,  even  for  the  most  skilful 
observers,  and  its  verdict  cannot,  therefore,  be  greatly  relied 
upon.  Theoretically  it  is  quite  conceivable  that  just-noticeable 
differences,  though  equivalent  to  one  another  as  being  all  just 
noticeable,  i.e.  as  being  sense-distances  so  small  that  the 
slightest  diminution  of  them  would  cause  them  all,  equally,  to 
cease  to  be  noticeable,  yet  as  noticed  or  perceived  would  appear 
of  different  magnitude  one  from  another.  Ebbinghaus  points 
to  the  analogous  case  of  differentials  or  infinitesimals  in 
mathematics.  These  are  all  equivalent  to  one  another  as 
being  all  equally  negligible  as  compared  with  finite  magnitudes, 
yet  are  by  no  means  necessarily  equal  to  one  another.  If  they 
belong  to  different  "orders,"  those  of  a  higher  order  are 
negligible  as  compared  with  those  of  a  lower,  etc.  Again,  "  the 
least  distances  perceived  as  such  at  different  parts  of  the  skin 
or  in  direct  and  indirect  vision  do  not  by  any  means  all  appear 
as  equal  magnitudes.  On  the  contrary,  so  soon  as  they  come 
to  consciousness  as  distances  they  are  at  once  perceived  as 
distances  of  varying  size,  in  a  certain  approximation  to  their 
objective  differences*."  In  spite  of  these  considerations, 
Ebbinghaus  regards  the  correspondence  of  the  stimulus  results 
obtained  for  equal  appearing  intervals  and  for  just  perceptible 
intervals  in  the  case  of  brightness-intensities  (in  the  middle 
region  of  the  scale  both  series  of  stimuli  form  a  geometrical 
progression)  as  sufficient  evidence  for  the  approximate  equality 
of  the  latter  intervals.  Muller  and  Wundt  had  previously 
advanced  the  same  argument. 


•  Ebbinghaus:  GrundzUge  der  Psychologie,  2nd  ed.,  1905,  p.  524. 
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Several  experimental  investigations  have  been  made  with 

the  express  purpose  of  testing  the  relation  of  the  methods  of 

minimal  change  and  mean  gradations.    Titchener*  sums  up 

the  theoretical  basis  of  such  experiments  concisely  as  follows : 

"  There  are  in  reality  two  possible  ways  of  working.    (1)  We 

might  take  a  series  of  i^-values,  corresponding  to  a  series,  say, 

of  eight  successive  j.  n.  d.  of  S,  and  thereafter  directly  compare 

the  two  half-distances,  of  four  j.  n.  d.  each,  and  decide  upon 

their  equality  or  inequality.    This  would  be  a  direct  method  of 

experiment :  determine  the  i?-difFerences  as  a  —  h,  b  —  c,  c  —  d, 

d  —  e,  e—f,  f—  g,  g  —  h,  h  —  i,  and  then  see  if  a  —  e  is  equal  in 

S  to  e  —  i.    Or  (2)  we  mighb  determine  a  few  j.  n.  d.  of  8,  at 

different  parts  of  the  ii-scale,  in  order  to  establish  the  constancy 

of  the  relative  BL,  and  thereafter  work  with  supraliminal  d., 

and  decide  whether  the  same  uniformity  holds.    This  would  be 

an  indirect  method :  it  is  the  method  indicated  by  the  authors 

just  cited  [Miiller,  Wundt,  Kohler,  and  Tannery].    Let  a-b 

and  h  —  i  be  i2-differences  corresponding  to  the  j.  n.  d.  of  S,  and 

PI       ^    •     T^T  ■     T  J.  i  —  h 

let  there  be  constancy  of  the  relative  JJL,  i.e.,  let         =  . 

If  the  j.  n.  d.  are  equal,  then  the  point  e,  which  lies  midway 

for  8  between  a  and  i,  must  be  such  that  — —  =  — 

e  =  Vai  Either  of  these  two  methods  would,  presumably,  take 
us  to  our  goal.  The  experimental  work  would  be  exceedingly 
difficult.  Liminal  determinations  are  always  and  intrinsically 
difficult;  and,  further,  the  judgments  passed  upon  j.  n.  d.  and 
upon  supraliminal  d.  are,  even  under  the  most  favourable 
conditions,  the  expressions  of  radically  different  mental  atti- 
tudes." 

It  should  be  mentioned  here  that  the  principal  rival  to 
Fechner's  hypothesis— the  "difference  hypothesis  "—is  that 
first  formulated  by  Plateau,  and  generally  known  as  the 
"quotient  hypothesis."  Plateau  adopted  the  psycho-physical 
formula 

8  =  cW   (5) 

*  "Experimental  Psychology,"  Vol.  ii.  Instructofs  Manual,  p.  Ixxvu. 
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on  the  basis  of  experiments  by  the  method  of  mean  grada- 
tions. This  implies,  in  the  place  of  Fechner's  fundamental 
formula,  the  formula 

dS_,dR   /g) 

in  other  words,  it  assumes  that  j.  n.  d.  are  relatively,  not 
absolutely  equal  >Sf-magnitudes.  Although  Plateau  himself 
withdrew  his  formula  later,  the  "quotient  hypothesis"  still 
remains  as  the  rival  of  Fechner's  "difference  hypothesis." 

To  return  to  the  experiments.  Merkel  (1888)  worked  with 
brightnesses,  pressures,  and  noises,  and  found  that  the  i^-value 
of  the  >S  bisecting  a  supraliminal  sense-distance  was  the 
arithmetic  mean  of  E-values  of  the  two  terminal  sensations, 

i.e.,  in  Titchener's  notation,  e  =  ^^-y~>  "^^^^  would 

seem  to  support  the  quotient  hypothesis,  although  such  an 
inference  is  not  entirely  free  from  objection.  Angell  (1892) 
worked  with  noise-intensities  by  the  method  of  mean  gradations, 
avoided  certain  sources  of  error  present  in  Merkel's  form  of 
procedure,  and  obtained  results  supporting  the  difference 
hypothesis,  e  =  V ai. 

A  more  thorough  investigation  was  carried  out  by  W.  Ament* 
in  1900.  He  used  a  series  of  Marbe  greys  for  the  brightnesses 
and  employed  the  direct  method ;  for  the  noise-intensities  he 
used  a  Fechner  sound  pendulum  and  worked  principally  by 
the  indirect  method.  The  result  reached  was  that  "  the  DL  as 
a  magnitude  increases  with  increasing  R,"  i.e.  it  supported  the 
quotient  hypothesis.  Titchenerf  summarises  Ament's  con- 
clusions as  follows : 

"  The  result  is  illustrated  by  the  following  Figure. 


^2    ^3     ^4  n 


Fig.  2. 

*  W.  Ament :  "  Ueber  das  Verhiiltnis  der  ebenmerklicheii  zu  den  iibermerk- 
licheu  Unterschiedeu  bei  Licht-  und  Scballinteusitiiten,"  Phil.  Studien  xvi 
1900,  pp.  135  fi. 

t  Op.  cit.  pp.  Ixxxv,  Ixxxvi. 
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The  values  r^,  i\,  r.^,  etc.,  are  E-intensities,  arranged  in 
ascending  order :  the  distance  between  every  pair  of  successive 
r  represents  a  DL.  The  distances  rj — r^,  r^—r^,  etc.,  in- 
crease by  equal  increments.  It  is  clear  (1)  that  the  distance 
— ^'fl,  though  it  contains  the  same  number  of  j.  n.  d.,  will 
appear  greater  than  the  distance  — ;  (2)  that  the  difference 
decreases,  the  nearer  the  compared  distances  approach  to  r^, — 
so  that  a  comparison  of  — with  rg — 1\  may  even  evoke 
the  judgment  'equal';  and  (3)  that  a  comparison  of  r-^—r.^ 
with  rg — r-g  will  evoke  a  decided  judgment  of  difference,  although 
the  two  distances  are  alike  j.  n.  d." 

Ament's  work  has  not  escaped  criticism,  A  repetition  of  his 
experiments  on  brightness-intensities  by  Frobes*  has  failed  to 
confirm  his  results.  Ebbinghaus  also  has  found,  in  careful  ex- 
periments with  rotating  sectors,  that  just-noticeable  differences 
in  different  parts  of  the  scale  of  brightnesses  are  equal  to  one 
another.  On  the  whole,  therefore,  the  balance  of  evidence  seems 
to  be  in  favour  of  the  "  difference  hypothesis." 

Fechner's  third  assumption — see  above,  p.  4 — brings  us  to 
the  question  of  the  interpretation  of  Weber's  Law. 
There  are  three  general  forms  of  interpretation : 

(1)  the  psycho-physical  (Fechner), 

(2)  the  physiological  (Mtiller,  Ebbinghaus,  James,  etc.), 

(3)  the  psychological  (Wundt). 

According  to  (1),  the  logarithmic  transition  takes  place  in 
passing  from  the  physiological  changes  in  the  sensory  centres  of 
the  cerebral  cortex  to  the  corresponding  sensation-intensities. 
The  chief  objection  to  this  view  of  Fechner's  is  that  Weber's 
Law  is  not  exact.  This  same  consideration  supports  (2),  or  the 
physiological  view,  according  to  which  the  transition  occurs  either 
at  the  inception  of  the  stimulus  in  the  sense  organ,  or  somewhere 
between  the  nerve  endings  and  their  central  connections  in  the 
sensory  areas  of  the  cortex.  Experimental  results  obtained 
by  Waller  t  and  SteinachJ  are  in  favour  of  this  view.  Waller 
*  Zeitschrift  fUr  Psychologie,  xxxvi.  p.  344. 

t  Waller:  "Points  relating  to  the  Weber-Feohner  L&vf,"  Brahi,  Vol.  xviii. 
p.  200,  1895. 

+  Steinach  :  "  Elecktromotorische  Erscheinungen  an  Hautsinuesnerven  bei 
adiiquater  Reizung,"  PJlUger's  Arch.  Bd.  63,  S.  495,  1896. 
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stimulated  a  frog's  eye  with  light  of  different  intensities,  and 
found  that  the  corresponding  "  negative  variations  "  set  up  in 
the  optic  nerve  varied  in  intensity  as  the  logarithm  of  the 
stimulus-values  (approx.).  Steinach  obtained  similar  results 
on  stimulating  the  skin  of  a  frog's  thigh  with  weights  and 
noting  the  negative  variation  in  the  attached  nerve.  If  the 
negative  variation  may  be  assumed  to  be  proportional  to  the 
intensity  of  the  nerve-current  passing  along  the  nerve,  these 
results  point  to  the  conclusion  that  the  logarithmic  transition 
occurs  in  the  sense  organ  and  its  sensory  nerve  endings. 

The  effect  would  thus  be  analogous  to  that  of  friction  and 
inertia  in  machinery.  Ebbinghaus*  has  constructed  a  theory 
based  upon  the  conception  of  varying  degrees  of  dissociability  of 
complex  molecules  to  account  for  the  law  and  also  for  the  devia- 
tions from  it  towards  the  two  extremes  of  the  intensity  scale. 

The  psychological  view,  (3),  of  the  law,  held  by  Wundt, 
regards  it  as  a  special  case  of  the  general  psychological  "  law 
of  relativity."  Stimulus,  physiological  process,  and  pure  sensa- 
tion-intensity increase  in  simple  proporbion  to  one  another. 
The  logarithmic  transition  occurs  in  passing  from  mere  sensation 
and  sensation  difference  to  apperceived  sensation  and  apper- 
ceived  sensation  difference,  and  the  intensities  are  apperceived 
always  in  relation  to  one  another.  In  addition  to  the 
objection  that  it  regards  the  sensation-intensities  themselves, 
not  their  distances  from  one  another,  as  measureable  magnitudes, 
this  view  is  also  open  to  the  criticism  that  it  furnishes  no  explana- 
tion of  the  widely  varying  size  of  the  relative  difference  limen 
in  different  sense-departments  (e.g.  DL  for  brightness-intensi- 
ties =  DL  for  sound-intensities  =  about  i);  the  physiological 
view  sees  in  the  varying  structure  of  the  different  sense-organs 
the  adequate  explanation  of  this.  Moreover,  the  psychological 
view  has  no  completely  satisfactory  explanation  to  give  of  the 
deviations  from  Weber's  Law  so  frequently  met  with. 

These  objections  and  difficulties  make  the  view  improbable, 
but  by  no  means  prove  it  to  be  impossible.    It  has  the  great 

*  Ebbinghaus  -.  "  Uber  den  Grund  der  Abweichungen  von  dem  Weber'schen 
Gesetz  bei  Lichtempfindungen,"  Pftiiger's  Arch.  Bd.  54,  S.  113,  1889. 
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merit  of  emphasizing  more  definitely  than  was  heretofore  the 
case  the  importance  of  the  more  purely  psychological  factors 
in  psycho-physical  experiments, — in  particular,  it  brings  into 
prominence  the  distinction  between  mere  disparity  of  sensation- 
intensities  present  simultaneously  or  in  immediate  succession 
in  the  same  consciousness  and  the  perception  of  this  disparity, 
the  discHmination  of  the  intensities  one  from  the  other.  In 
psycho-physical  experiments  the  subject's  consciousness  is  not 
limited  to  the  mere  sensational  level. 

Although,  with  continuous  increase  of  stimulus-intensities 
the  corresponding  sensation-intensities  rise  in  "  steps,"  each 
representing  a  just-noticeable  difference,  the  psycho-physical 
relation  is  really  a  strictly  continuous  one.  This  becomes  at 
once  obvious  if  we  consider  a  special  case.  The  sensation- 
intensity  aroused  in  lifting  a  wt.  of  100  grms.  is  indistinguish- 
able from  that  aroused  by  102  grms.,  the  sensation  aroused  by 
102  grms.  is  indistinguishable  from  that  aroused  by  104  grms. ; 
yet  the  sensation-intensity  aroused  by  100  grms.  is  perceptibly 
less  than  that  aroused  by  104  grms.  Thus  the  sensation- 
intensity  increases  continuously,  and  the  reason  why  this  is 
not  immediately  apparent  is  probably  to  be  looked  for  in 
the  physiological  mechanism  of  the  pyscho-physical  organism. 
Delboeuf  held  that  the  limen  has  no  psychological  importance 
whatever.  If  this  is  an  extreme  view,  the  importance  which 
Fechner  attributed  to  the  limen  is  equally  extreme  in  the 
other  direction,  and  farther  removed  from  the  truth. 

The  absolute  or  stimulus  limen,  RL,  is  similar  in  kind  to 
the  difference  limen,  since  consciousness  is  never  empty  of 
sensation-intensities  when  such  a  limen  is  being  determined. 
Here  again,  Fechner's  rigid  distinction  of  the  two  was  a  funda- 
mental error. 

Indirect  Methods  of  Measurement. 

The  preceding  account  has  probably  sufficed  to  show  that 
purely  psychical  measurement  is  a  possibility.  Its  practical 
application,  however,  has  been  more  detailed  than  extensive.  A 
more  generally  useful  method  in  quantitative  psychology  has 
been  found  to  be  that  which  measures  the  external,  physical  or 
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physiological,  causes  and  effects  of  mental  process.  The  measure- 
ments are  made  in  terms  of  the  physical  units  of  space  and  time, 
yet  they  are  not  merely  physical  measurements,  since  they  derive 
all  their  significance  from  the  correlated  psychical  processes. 
They  are  indirect  psychical  measurements*.  Measurements  of 
reaction-times,  memory,  fatigue,  illusions,  etc.  are  all  of  this 
nature.  Their  varieties  are  innumerable,  and  are  illustrated  by 
the  accounts  in  any  good  text-book  of  experimental  psychology 
(Sanford,  Titchener,  Myers).  In  all  cases  full  introspective 
accounts  are  essential,  and  when  correlated  with  the  measure- 
ments make  the  latter  essentially  psychical  measurements. 
Measurements  of  limina,  refen-ed  to  in  the  previous  section, 
are  of  the  same  nature.  They  are  of  some  special  importance 
as  being  measures  of  sensory  acuity,  etc. — aspects  of  the  total 
mental  ability  of  psycho-physical  organisms.  They  figure 
prominently  in  many  researches  based  on  the  use  of  "mental 
tests." 

A  method  which  makes  a  partial  return  to  the  more  purely 
psychical  form  of  measurement  in  terms  of  "distance"  is  the 
method  of  ranks  or  grades.  Suppose  we  are  considering  the 
relative  abilities  of,  say,  100  boys  in  English  Composition.  We 
should  find  it  difficult  to  mark  their  essays  individually  in 
terms  of  any  constant  unit  but  might  find  it  possible  to  arrange 
them  in  order  of  merit,  especially  if  we  had  sufficient  time  at 
our  disposal  to  employ  the  method  of  "paired  comparisons." 
According  to  the  procedure  of  this  latter  method,  the  essays 
would  be  taken  in  pairs,  quite  at  random,  and  the  better  essay 
of  each  pair  would  be  given  a  "preference  mark."  This 
procedure  would  be  repeated  again  and  again  until  every  essay 
had  been  compared  with  every  other  essay.  The  order  of 
merit  is  then  given  by  the  number  of  preferences  attaching  to 
each  essay.  In  this  order,  however,  we  cannot  assume  that  the 
"abihty-distance"  from  one  boy  to  the  next  is  a  constant 
quantity.  The  boys  near  the  extreme  ends  of  the  series  will 
be  farther  removed  from  one  another  than  the  boys  near  the 
middle.  We  could  only  adjust  for  this  if  we  knew  the  law  of 
frequency-distribution  for  this  kind  of  ability  in  this  particular 
*  See  Ebbinghaus,  Grundzilge  der  Psychologic,  1905,  pp.  75,  76. 
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species  of  boy,  and  theoretically  the  determination  of  this 
distribution  depends  upon  a  prior  fixing  of  the  psychological 
unit,  the  unit  of  "  ability-distance  " ;  so  that  strictly  speaking 
the  problem  is  insoluble.  Since,  however,  under  certain 
deiinite  and  indefinite  conditions,  the  form  of  distribution  in  a 
large  number  of  cases  of  physical  measurement  has  been  found 
to  be  Gaussian  or  "  normal,"  we  might,  with  some  probability 
of  being  near  the  truth,  assume  this  form  of  distribution  in 
the  given  case  and  so  obtain  a  quantitative  measure  for  the 
ability  of  each  particular  boy.  A  direct  psychological  determina- 
tion and  application  of  the  (conventional)  unit-distance  is  not, 
perhaps,  an  entirely  impossible  problem,  and  work  in  this 
direction  may  be  expected  in  the  future.  The  method,  if 
achieved,  would  certainly  be  much  more  scientific  and  psycho- 
logical than  the  present  one  which  measures  in  terms  of  external 
quantum  of  work  done. 

Finally,  the  interrelations  of  different  mental  abilities 
within  any  well-defined  group  of  individuals  situated  within 
any  definite  environment  may  be  determined  by  means  of 
the  technical  method  of  "  correlation."  A  correlation  coefficient 
or  other  constant  (e.g.  correlation  ratio)  measures  the  tendency 
■towards  concomitant  variation  of  two  mental  (or  other)  abilities 
within  a  group  of  individuals.  The  result  may  be  transferred 
to  any  single  individual  within  the  group  as  measuring  the 
degree  of  probability  of  connection  or  the  closeness  of  con- 
nection of  the  two  abilities  in  the  particular  case.  The 
correlation  between  the  two  abilities  may  be  due  to  an  actual 
direct  relation  of  the  abilities  to  one  another,  or,  indirectly,  to 
the  influence  of  a  common  external  environment  upon  them 
both.  The  first  of  these  two  cases  is  perhaps  the  more 
important,  but  the  possibility  of  the  second  should  not  be  lost 
sight  of,  and  it  also  has  a  special  interest  of  its  own.  The 
problem  of  correlation  will  be  considered  more  fully  in  a  later 
chapter. 


CHAPTER  II 


THE  PSYCHO-PHYSICAL  METHODS 

The  experimental  determination  of  absolute  and  difference 
thresholds  or  limina  is  complicated  and  difficult.  A  considerable 
number  of  physical  and  psychological  factors  are  involved,  of 
varying  relative  importance  in  different  cases.  The  result  is 
that  different  methods  of  procedure  have  been  found  most 
suitable  for  different  cases.  These  methods  have  been 
traditionally  grouped  under  three  (or  four)  distinct  headings, 
and  called  the  Psycho-physical  Methods.    They  are: 

(1)  the  Method  of  Limits  (Method  of  Minimal  Changes), 

(2)  the  Method  of  Average  Error  (Method  of  Production), 

(3)  the  Constant  Method  (Method  of  Right  and  Wrong 
Cases). 

A  fourth  method  is  generally  added  to  the  list,  viz. : 

(4)  the  Method  of  Equal-Appearing  Intervals,  or  Method 
of  Mean  Gradations,  but  this  is  really  no  new  method.  It 
owes  its  special  name  to  the  nature  of  the  task  which  it  fulfils, 
viz.  the  determination  of  equal-appearing  (iibermerklich)  sense- 
distances  as  distinguished  from  just  perceptible  (ebenmerklich) 
sense-distances.  The  method  which  it  employs  falls  under  one 
or  other  of  the  first  three  headings. 

1.    The  Method  of  Limits. 

In  using  this  method  for  the  determination  of  difference 
limina  the  following  mode  of  procedure  is  adopted.  The 
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variable  stimulus,  V,  is  first  made  equal  or  slightly  larger  than 
the  standard,  S,  and  then  gradually  increased  by  small  increments 
until  the  subject  finds  it  just  noticeably  greater  than  S.  F  is 
increased  still  more,  and  then  gradually  diminished  until  it 
just  ceases  to  appear  greater  than  S.  The  mean  of  the  two 
values  of  V—  S  thus  obtained  is  the  upper  difference  limen,  D„. 
Four  values,  instead  of  two,  might  also  be  obtained,  viz,  for 
(i)  Fjust  not  perceptibly  greater  than  (ii)  Fjust  perceptibly 
greater  than  S,  (iii)  V  just  perceptibly  greater  than  S,  (iv)  V 
just  not  perceptibly  greater  than  S,  (i)  and  (ii)  being  for 
ascending  values  of  V,  (iii)  and  (iv)  for  descending.  Du  will  be 
the  mean  of  these  four  values  of  V  —  S. 

The  lower  difference  limen,  Di,  is  obtained  in  a  similar  way. 
Both  limina  may  be  obtained  in  the  same  series  of  experiments 
by  the  "  method  of  complete  descent  and  ascent." 

A  series  of  determinations  of  each  limen  is  made  and  the 

average  taken.    The  reliability  of  this  average  measure  is 

conveniently  given  by  the  "mean  variation"  (m.v.),  which  is 

the  average  of  the  deviations  of  the  individual  determinations 

(these  deviations  all  being  reckoned  as  positive)  from  their 

average.    If  there  are  n  individual  determinations  cci,  x.^, ...  x^, 

with  average  x,  then 

(x  ~  x^)  +  (^  ~  iCa)  +  . . .  +  (a;  ~  a;,i) 

M.V.  =  ^  '  • 

n 

M.v.  thus  measures  the  "scatter"  of  the  individual  measures 
about  their  mean.  Other  measures  of  "  scatter  "  are  the  "  measure 
of  precision  "  Qi)  (inverse),  the  standard  deviation  {a)  and  the 
"probable  error"  (p.E.).  The  general  question  of  the  measure 
of  "  scatter  "  will  be  considered  later  on  in  this  book.  ^ 

The  number  and  size  of  the  increments  employed  must  be 
adjusted  to  the  particular  conditions  of  the  experiment.  The 
subject  of  the  experiment  should  be  given  a  certain  amount  of 
preliminary  practice  before  being  started  upon  the  work,  and 
introspective  reports  should  be  asked  of  him. 

Among  the  various  possible  sources  of  error  which  deflect 
the  subject's  judgment,  two  are  of  special  importance.  They 
arise  from  the  temporal  and  spatial  arrangement  of  the  com- 
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pared  stimuli,  and  are  called  the  "time  error"  and  "space 
error"  respectively.    Thus,  in  a  determination  of  the  difference 
limen  for  sound-intensities,  the  two  stimuli,  S  and  V,  cannot 
be  presented  simultaneously.    One  must  precede  the  other, 
and  iu  this  way  it  may  produce  a  slight  degree  of  fatigue 
which  causes  an  over-estimation  of  the  other,  or  it  may  produce 
the  reverse  effect  of  sharpening  the  attention  to  the  second. 
Thus  a  time  eiTor  arises.  Again,  in  experiments  with  brightness- 
intensities  or  visual  extents,  where  the  stimuli  can  be  presented 
simultaneously,  a  space  error  arises  from  the  fact  that  the  one 
stimulus  must  be  presented  either  to  the  right  or  to  the  left  of 
the  other  stimulus  and  the  subject's  judgment  varies  accordingly. 
In  experiments  with  lifted  weights  both  sources  of  error  may 
be  involved.     These  so-called  constant  errors  may  be  ap- 
proximately neutralised  by  arranging  that  in  the  course  of 
the  experiment  the  standard  shall  precede  the  variable  or  stand 
to  the  right  of  the  variable  in  half  the  cases  and  follow  or 
stand  to  the  left  in  the  other  half — the  time  or  space  order,  or 
both,  being  of  course  changed  quite  at  random  in  the  successive 
limen-determinations.    A  better  plan  is  to  evaluate  the  limen 
(and  its  M.v.)  for  each  time  and  space  order  separately.  This 
gives  us  the  values  of  the  time  error  and  space  error,  which  are 
of  interest  for  their  own  sakes.    Fechner's  theory  of  these  errors 
and  their  measurement  is  based  upon  the  assumption  that  the 
time  and  space  order  of  the  two  stimuli,  *S  and  V,  exert  an 
influence  upon  the  result  which  is  equivalent  to  a  definite  in- 
crease or  diminution  in  the  stimulus-value  of  one  or  the  other, 
and  thus  increase  or  diminish  the  value  of  ;S  ~  F  by  the 
amount  of  the  time  error,  p,  or  that  of  the  space  error,  q,  or 
that  of  both  together,  p  +  q,  which  =  c,  the  constant  total  error. 
The  time  error,  p,  is  positive  or  negative  according  as  the  effect 
of  the  time  order  is  to  increase  or  diminish  the  apparent  value 
of  the  first-presented  stimulus.    The  space  error,  q,  is  positive 
or  negative  according  as  the  effect  of  the  space  order  is  to 
increase  or  diminish  the  apparent  value  of  the  left-hand 
stimulus. 

Four  principal  cases  of  time  and  space  order  are  possible, 
and  are  conventionally  numbered  as  follows: 
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I.  Standard  presented  first  and  to  the  right, 

II.  „  „       second    „  „ 

III-  »        first       „    „  left, 

IV-  »  „        second    „  „ 

Employing  these  numbers  as  suffixes,  we  have  the  equations* 
2  i)„  =       +  D^iY  =  Dull  +  n^m 

2ci  =Dii  -Aiv  =i>«iv-i)«i 
2  c„  =  Din  -  Am  =  D^ui-  Duu 
In  the  case  where  a  difference  of  time  order  alone  is 
possible,  the  above  equations  reduce  to 

2  Du  =  Dui  +  Dun 

2Di  =  Dii+Diu 
2p  =  Dui- Dull  =  Dill-  Dn. 
Corresponding  equations  are  valid  for  the  space  error  q,  in 
a  case  where  a  difference  of  space  order  alone  is  present. 

The  errors  due  to  expectation,  habituation,  fatigue,  etc.,  are 
neutralised  or  at  least  reduced  to  a  minimum  by  determining 
the  limen  by  means  of  both  ascending  and  descending  values 
of  Fand  averaging,  and  by  other  special  precautions  in  applying 
the  method. 

A  modification  of  the  method  which  effects  a  more  complete 
elimination  of  the  expectation  error  by  the  use  of  occasional 
"  catch  experiments,"  where  the  variable  given  is  equal  to  the 
standard,  is  that  known  as  the  Method  of  Serial  Groups  f. 

Each  fixed  value  of  V  is  presented  with  the  standard  ten 
times,  not  in  immediate  succession  but  interspersed  at  random 
among  ten  other  values  of  V  equal  to  S.  The  percentage  of 
correct  answers  given  by  the  subject  is  noted,  and  the  experi- 
menter passes  on  to  the  next  value  of  V  which  is  presented  along 
with  catch  stimuli  in  a  similar  manner.  The  value  of  V  which, 
as  presented  in  this  way,  gives  80  "/o  right  answers  is  arbitrarily 
chosen  as  measuring  the  limen.    This  method  combines  with 

*  G.  E.  Miiller:  Die  Gesichtspunkte  und  die  Tatsachen  der  Psychophysischen 
Methodik,  1904,  pp.  67,  71. 

t  See  C.  S.  Myers:  Textbook  of  Experimental  Psychology,  p.  209. 
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the  principle  of  the  method  of  limits  that  of  the  constant 
method  (yet  to  be  described)  and  is  a  very  convenient  one  to 
use  in  measuring  a  large  number  of  subjects  in  "  mental  test " 
experiments,  where  economisation  of  time  is  essential. 

2.    The  Method  of  Average  Error. 

In  this  method  the  subject  is  required  to  adjust  a  variable 
stimulus,  V,  so  that  it  seems  subjectively  equal  to  a  given 
standard  stimulus,  s.  The  experiment  is  repeated  a  large 
number  of  times— at  least  100— and  the  arithmetical  mean,?;^, 
of  all  the  obtained  stimulus- values  is  calculated.  The  difference 
between  this  mean  value  and  the  standard  stimulus  is  known 
as  the  crude  constant  error,  e.    It  may  be  either  positive  or 

negative.  The  mean  variation,  M.V.  =  ,  is  also  cal- 
culated. 

The  crude  constant  error  may  be  partly  due  to  a  space  error 
(the  time  error  cannot  occur  in  this  method),  partly  to  other 
constant  conditions*.  Let  us  assume  that  the  experiment  is 
to  adjust  the  length  of  a  variable  line  until  it  seems  to  be 
equal  in  length  to  a  standard  line.  In  this  case,  the  standard 
should  be  situated  to  the  right  in  one-half  the  number  of 
adjustments  and  to  the  left  in  the  other  half,  either  alternately 
or  in  haphazard  order.  The  results  are  tabulated  in  two 
columns,  I  standard  to  right,  II  standard  to  left,  and  v^i,  '^mii 
are  calculated  from  these  two  columns  separately.  Then,  if  q  be 
the  required  space  error  and  k  the  residual  constant  error,  we 
have 

Vmi-  s  =  -  q  +  k,  Vmii  -  s  =  +  q  +  k, 

Vmll  —  ''^ml       J      Vmll  +  Vml 

i.e.  q  =  2  '   ^  =  2  

In  order  to  give  as  much  definiteness  as  possible  to  the 
task  of  adjustment,  the  variable  should  start  sometimes  shorter 
than  the  standard,  sometimes  (an  equal  number)  longer,  and 
the  adjustment  be  made  by  lengthening  and  shortening  re- 
spectively.   Again,  the  requisite  amount  of  lengthening  or 

*  Cf.  E.  B.  Titohener,  Experiviental  Psychology,  Vol.  ii.  Stiideitt's  Manual, 
p.  74.  "A  'constant'  error  is  simply  an  error  whose  conditions  are  constant; 
its  amount  may  vary,  quite  considerably,  from  stage  to  stage  of  a  long  series  of 
experiments." 

B.  2 
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shortening  should  be  arranged  to  be  different  on  different 
occasions,  but  alternating  with  some  degree  of  uniformity. 

The  value  of  m.V.  (mean  variable  error)  obtained  in  this 
method  is  from  a  general  point  of  view  a  more  important  result 
than  the  value  of  the  constant  errors,  since  it  has  often  been 
regarded  as  proportional  to  the  value  of  the  difference  threshold 
as  determined  by  the  other  two  psycho-physical  methods.  The 
truth  is  that  although  there  is  a  certain  amount  of  proportionality 
between  the  values,  this  proportionality  is  not  complete.  The 
correlation  of  mean  variable  errors  and  corresponding  just 
noticeable  differences  is  less  than  unity.  Under  certain  con- 
ditions the  two  values  vary  in  opposite  directions.  It  is  hardly 
necessary  to  point  out  that  the  M.  V.  obtained  in  the  Method  of 
Limits  has  but  little  relation  to  the  mean  variable  error  obtained 
by  the  Average  Error  Method.  They  are  not  entirely  unrelated, 
however. 

The  closest  correspondence  of  any  is  that  between  the 
"  frequency  distribution  "  of  the  errors  in  the  present  method, 
and  that  of  the  judgments  "equal"  obtained  by  the  Constant 
Method. 

3.    The  Constant  Method. 

This  is  generally  considered  to  be  the  most  satisfactory  of  the 
psycho-physical  methods.  It  can  be  employed  with  equal  conveni- 
ence for  the  determination  of  absolute  thresholds;  difference  thres- 
holds, equally-appearing  sense- distances,  and  other  measurements 
of  psychological  importance.  The  different  values  of  the  variable 
stimulus  to  be  employed  are  fixed  once  for  all  at  the  beginning 
of  the  investigation,  and  are  presented  to  the  subject  a  large 
number  of  times  (say  100  applications  of  each  variable  stimulus 
value)  in  irregular  order,  or  in  a  prearranged  order,  unknown  to 
the  subject, corresponding  to  certain  precautions  to  be  mentioned 
later.  If  an  absolute  threshold  is  being  determined,  the  variable 
is  presented  alone,  if  a  difference  threshold,  it  is  on  each  occasion 
preceded,  accompanied  or  followed  by  the  standard  stimulus. 
In  the  latter  case,  the  subject  returns  the  replies,  "  greater," 
"uncertain"  or  "equal,"  "less"  with  reference  either  to  the 
standard,  or  to  the  variable,  or  on  some  other  prearranged 
principle.    The  percentage  of  each  of  these  three  types  of 
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answers  is  determined  for  each  value  of  the  variable  employed, 
and  recorded.  The  required  threshold  is  taken  as  that  corre- 
sponding to  the  variable  which  evokes  50  "/o  objectively  correct 
answers,  and  consequently  also  50° /„  of  incorrect  and  "uncertain" 
(or  "  equal ")  answers. 

Taking  as  an  example  of  the  determination  of  an  absolute 
threshold  that  of  the  "  spatial  threshold,"  we  may  say  that  it  is 
given  by  50 "/o  two-point  judgments;  and  as  an  illustration  of 
the  detailed  working  of  the  method,  we  shall  find  it  convenient 
to  refer  to  a  series  of  results  obtained  by  Riecker  (Zeitschr.  f. 
Biologie,  Bd.  10)  which  has  already  served  in  the  descriptions 
of  the  method  given  by  Muller,  Titchener,  and  Myers.  Riecker 
obtained  the  following  results  in  an  investigation  of  the 
"spatial  threshold"  or  threshold  of  two-point  judgments  of  the 
skin  of  the  lower  eyelid  : 


D  (distance  between  the 
points  of  the  aestheso- 
meter  in  Paris  lines*) 

0 

0-5 

1 

1-5 

2 

3 

4 

5 

6 

^(%  of  two-pt.  judgments) 

30 

10 

14 

40 

65 

80 

87 

96 

100 

(-20)  (4)   (26)  (25)  (15)  (7)    (9)  (4) 

It  will  be  observed  that,  with  two  exceptions,  the  series  of 
percentages  follows  a  general  law  of  increase,  the  rate  of 
increase  itself  increasing  at  first  and  then  diminishing.  The 
numbers  within  the  brackets  are  formed  by  subtracting  each 
percentage  from  the  immediately  succeeding  one,  and  show 
this  uniformity  more  clearly.  The  two  exceptions  are  at  Z)  =  0 
and  at  i)  =  5.  At  D  =  0  the  percentage  is  greater  than  at  the 
immediately  succeeding  interval.  This  irregularity  is  known 
as  an  inversion  of  the  first  order.  The  cause  of  it  is  doubtless 
to  be  looked  for  in  the  exceptional  way  in  which  the  stimulus 
(one  point)  may  have  been  applied,  the  pressure  or  general 
nature  of  the  contact  may  have  been  different  from  what  they 
were  with  two-point  contact,  or  some  misleading  suggestion 
may  have  accompanied  these  particular  experiments. 

*  A  Paris  line  =  2-25  mm. 

2—2 
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At  D  =  5,  the  percentage  is  indeed  larger  than  that  for  jD  =  4 
and  smaller  than  that  for  D  =  6,  but  reference  to  the  bracketed 
numbers  shows  that  the  increase  from  4  to  5  is  greater  than 
the  increase  from  3  to  4,  thus  breaking  the  general  rule  as 
regards  rate  of  increase.  This  irregularity  is  known  as  an 
inversion  of  the  second  order,  and  being  in  the  present  case 
slight  is  probably  to  be  explained  as  due  to  an  insufficiency  in 
the  number  of  applications  of  the  stimulus. 

It  is  important  to  realise  clearly  the  fact,  which  G.  E.  Miiller* 
was  the  first  to  point  out  and  emphasize,  that  a  limen  is  a 
variable  magnitude  following  a  certain  law  of  frequency- 
distribution  f.  There  is  no  fixed  limen,  only  an  average  limen, 
a  most  frequent  limen  or  a  most  representative  limen.  The 
percentages  in  Riecker's  table  represent  the  relative  frequencies 
of  limina  for  distances  between  the  compass-points  less  than 
the  corresponding  D's.  Thus  there  are  65  Vo  limina  less  than 
2  Paris  lines,  80  "/o  limina  less  than  3  lines ;  i.e.  there  are 
80  -  65,  or  15  "/o  limina  between  the  limits  of  stimulus  values 
2  lines  and  3  lines. 

This  suggests  the  plan  of  plotting  a  "frequency-polygon" 
for  the  limina.  For  the  present  case  (omitting  the  anomalous 
percentage  value  for  0)  the  frequency-polygon  is  given  by  the 
accompanying  figure  (Fig.  3). 


a 


0-5 
— >• 


1     1-5  2 


Fig.  3. 


*  This  at  least,  is  the  view  to  which  Miiller  himself  inclines  {op.  cit.  p.  59), 
but  he  deduces  his  formulae  on  the  assumption,  supported  by  Fechner  and 
Bruns  that  the  threshold  has  a  single  definite  value,  D,  subject  in  the  course  of 
an  experimental  determination  to  variable  apparent  increase  or  decrease  by 
random  influences,  ±5,  which  obey  the  Gaussian  "  Law."  He  points  out  that 
both  views  lead  to  the  same  formulae. 

+  The  term  "frequency-distribution"  is  here  used  in  a  loose  sen^^e.  Its 
technical  significance  involves  the  conception  of  "  random  samples"  taken  from 
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The  area  of  each  rectangle  represents  the  percentage  or 
relative  frequency  of  the  limina  lying  between  the  stimulus- 
limits  indicated  by  the  abscissae  of  the  feet  of  the  two  perpen- 
diculars forming  its  vertical  sides.  Thus  the  area  of  abed  is  15. 
The  irregularity  of  the  frequency-polygon  or  "curve"  is  probably 
due  to  the  relatively  small  number  of  observations  recorded. 
Taking  the  limit  of  an  exceeding  large  number  of  (hypothetical) 
observations  or  cases  we  might  have  obtained  one  or  other  of 
the  two  general  types  of  frequency-curve  depicted  in  Fig.  4: 

PRS 


M   N  T 

Fig.  4. 


.4  is  a  symmetrical  curve,  B  is  known  as  a  skew  curve.  In 
both  cases  the  distribution  of  cases  (spatial  limina  in  the 
present  instance)  is  represented  by  the  area  enclosed  by  the 
curved  line  and  the  horizontal  line  or  axis  of  abscissae. 

There  are  three  ordinates  of  a  frequency-curve  which  are 
of  special  interest.    They  are : 

1.  the  maximum  ordinate,  whose  abscissa  is  the  mode; 

2.  the  centroid  vertical,  the  ordinate  passing  through  the 
"  centre  of  gravity  "  of  the  surface,  whose  abscissa  is  the  mean ; 

3.  the  ordinate  dividing  the  surface  into  two  equal  areas, 
whose  abscissa  is  the  median. 

a  "  total  population,"  and  is  explained  more  fully  in  the  last  section  of  this 
chapter.  The  description  on  pp.  21,  22  really  belongs  to  this  latter  account,  but 
is  inserted  here  because  necessary  for  a  proper  understanding  of  what  follows. 
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In  symmetrical  curves,  these  three  ordinates  coincide.  In 
asymmetrical  or  skew*  curves  they  do  not.  Thus,  in  Fig.  4,  B, 
PM  is  the  ordinate  corresponding  to  the  mode,  RN  that 
corresponding  to  the  median,  and  ST  that  corresponding  to 
the  mean.  The  abscissa  of  any  one  of  these  three  ordinates 
would  give  us  a  definite  representative  liminal  value.  In 
fixing  upon  50  Vo  two-point  judgments  as  the  measure  we 
choose  the  median.  Where  interpolation  is  justifiable,  i.e.  in 
results  where  the  number  of  limina  determined  is  sufficiently 
large,  so  that  a  continuous  curve  can  be  found  to  "fit"  the 
results  with  sufficient  closeness,  the  mode  of  this  curve  is  the 
best  measure  since  its  abscissa  gives  the  most  probable  or  most 
frequent  limen. 

The  "  degree  of  scatter  "  of  the  individual  limina  about  the 
modal,  mean,  or  median  limen  can  be  measured  either  by  the 
"standard  deviation,"  a,  or  by  the  "mean  variation,"  m.v. 
Restricting  our  attention  for  the  moment  to  the  symmetrical 
curve,  where  mode,  mean  and  median  coincide,  we  have  as  the 
value  of  0-,  the  square-root  of  the  mean  of  the  squares  of  the 
deviations  of  the  individual  values  from  the  mean  {mode  or 

median)  of  all  the  values ;  or  in  symbols,  a  =  /y/ ^ ,  where 

iV  is  the  total  frequency  (total  number  of  limina),  x  is  the 
deviation  of  any  individual  value  from  the  mean,  /  the  frequency 
of  each  group  of  equal  values  and  S  is  the  symbol  for  summa- 
tion (i.e.  S  {fx^)  =fxi^  +  fiX^  +  . . .  +  fN^N%  The  mean  variation 
has  already  been  explained  (p.  14).  Another  measure,  called  the 
"  probable  error,"  P.E.,  is  in  common  use.  It  =  "67449  <t,  and 
when  employed  in  a  general  way,  i.e.  for  all  kinds  of  skew  and 
all  kinds  of  symmetrical  curves,  may  be  regarded  as  merely  a 
conventional  reduction  of  the  standard  deviation.  It  only  has 
a  peculiar  significance  of  its  own  in  the  case  of  the  Gaussian  or 
"  normal "  curve,  in  which  it  corresponds  to  the  ordinate  which 
divides  either  half  of  this  symmetrical  curve  into  equal  areas, 
so  that  half  the  total  number  of  cases  are  included  within 
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the  limits  +  p.e.  In  Fig.  4,  curve  A,  the  area  of  the  curve 
between  the  ordinates  ef  and  gh  would  be  half  the  entire  area 
if  the  curve  were  a  "  normal "  curve. 

In  the  case  of  a  skew  curve,  these  measures  of  "scatter"  can  be 
taken  about  any  one  of  the  three  "  central  tendency  "  measures. 
They  are  also  different  for  each  half  of  the  curve,  although  it 
is  usually  sufficient  to  lump  the  two  values  together. 

It  is  most  important  to  distinguish  carefully  the  measure 
of  "central  tendency"  from  the  measure  of  "scatter."  Both 
should  be  evaluated  in  any  piece  of  research.  The  second  is 
a  measure  of  the  reliability  of  the  first,  or  at  least  of  the 
variability  of  the  individual  values  of  which  the  fir.^t  is  the 
average  or  representative  value. 

To  return  to  Riecker's  table.  The  limen  corresponding  to 
50  "/o  two-point  judgments  may  be  calculated  in  two  general 
ways :  (A)  from  the  observed  values,  (B)  by  finding  the  best- 
fitting  smooth  curve,  adjusting  the  observations  to  this,  and  then 
calculating  the  constants  (mode,  etc.,  a,  etc.)  from  the  curve. 

A.  The  required  limen  obviously  falls  somewhere  between 
1-5  lines  (40  "/o  two-point  judgments)  and  2  lines  (65  %)•  To 
determine  its  value  roughly  we  may  substitute  in  the  following 
formula,  devised  by  Wundt  in  another  connection  : 

where       =  distance  giving  Z^  Vo  two-point  judgments 

and      Za  is  the  percentage  just  above  50,  in  the  table, 

^&  "   >'  »         „    below  50,  „  „ 

Substituting, 

^^2  (50 -40) -I- 1-5  (65-50) 
65-40 
=  1  "7  Paris  lines. 

This  method  is  open  to  several  objections : 

(1)  It  does  not  employ  all  the  data;  it  uses  two  of  the 
percentages  only. 
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(2)  It  assumes  that  between  the  values  Da  and  Di  the 
frequency-curve  runs  parallel  to  the  axis  of  abscissae. 

(3)  It  gives  no  measure  of  scatter. 

A  value  which  can  be  determined  by  a  use  of  all  the  data 
is  that  of  the  mean  or  average  limen.  The  formula  to  be 
employed  is,  of  course,  the  well-known  formula  for  the  mean, 

N  ' 

where       x  =  abscissa  of  centre  of  base  of  any  column, 

/=  area  of  corresponding  column,  i.e.  the  relative 
frequency, 

and        N  =  total  relative  frequency. 
Applied  to  our  example,  this  gives 

75  X  4+1-25  X  26  +  1-75  x25-h2-5xl5-h3-5x 7  +  4-5  X  9  +  5-5x4 


D  = 


4+26  +  25  +  15  +  7  +  9  +  4 
203-75 


90 


=  2-26. 


It  has  been  recently  suggested  by  Dr  Spearman*  that  the 
most  representative  ordinate  of  a  column  (that  about  which  the 
limina  of  the  column  may  be  most  accurately  assumed  to  be 
concentrated)  is  not  the  mid-ordinate,  as  is  assumed  in  the 
above  formula,  but  an  ordinate  shifted  slightly  towards  the 
higher  of  the  two  neighbouring  columns  (see  footnote f).  He 
therefore  proposes  to  replace  /  of  the  preceding  formula  by  a, 
where 

_  n  ,  Dic^.-Dic  2A+.  +  3/,+/fc-, 

*  The  Method  of  "Eight  and  Wrong  Cases"  ("Constant  Stimuli")  without 
Gauss's  Formulae,  Brit.  Journ.  of  Psychology,  Vol.  ii.  1908,  pp.  227—242. 

t  Figure  given  by  Spearman,  op.  cit.  p.  241  (slightly  simplified). 

The  mid-ordinate  Cj,  is  replaced  by  the  ordinate  a*  which  passes  through 
the  centroid  of  the  trapezium  Dkl/kyk+i^k+i- 

is  the  mid-point  of       {  =  i  (A-i  +/*) }  > 

Vk+i      >'         "        "  Vk+iy 

average  relative  frequency  for  kth  column. 
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There  is  clearly  some  serious  misconception  here,  since  it 
has  long  been  known  that  the  mean  of  any  frequency-distribution 
needs  no  correction*.  A  glance  at  the  proof  (see  footnote) 
reveals  the  source  of  the  error.  Spearman  replaces  each  rect- 
angle by  a  trapezium  of  different  area  and  then  multiplies  the 
abscissa  of  the  centroid  vertical  of  this  new  area  by  the  old 
area,  which  area,  however  arranged,  could  not  possibly  have 
the  centroid  of  the  new  one.  The  unadjusted  value  is  the  best 
value  of  the  mean,  but  it  is  doubtful  whether  this  is  of  much 
significance  as  a  measure  of  the  limen. 

B.    Assuming  "  normal "  distribution  (Fechner,  Miiller). 

The  classical  description  of  this  method  is  given  in 
G.  E.  Miiller,  Die  Gesichtspunkte  and  die  Tatsachen  der  psycho- 
physischen  Methodik,  pp.  45 — 50,  56 — 58,  and  the  following 


This  gives  at  once  the  abscissa 


Dt+i-St 


xydx 


Jo  y^"" 

which  reduces  to  the  required  formula, 


3       •/m  +  2A+/fc-a- 


fk 


V  ,*  ^'       S^eppard:  "On  the  Calculation  of  the  most  Probable 

Values  of  Frequency  Constants,  etc.,"  Froc.  Loud.  Math.  Soc,  Vol.  xxix.  353  ff  • 
and  also  below,  p.  38.  >  • . 
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account  aims  at  being  merely  a  condensation  of  this  description. 
But  first  a  few  words  of  preliminary  explanation.    The  form  of 


Fig.  5. 


distribution  of  the  individual  limina  is  assumed  to  approximate 
with  a  fair  degree  of  closeness  to  the  so-called  Gaussian  or 
normal  curve,  or  "  curve  of  error,"  the  equation  to  which  is 
generally  written  in  the  form 

h 


y 


1«2 

'2  0-2 


(S  is  sometimes  put  in  place  of  x,  to  represent  "error":  see  later). 

A  more  convenient  form,  however,  is  that  employed  by  the 
biometricians,  viz. 

in  which  a,  as  usual,  =  the  standard  deviation 
and        N  =  total  number  of  cases  (area)  of  the  dis- 

tribution. 

N  may  be  taken  as  unity,  in  which  case  the  correspondence  of 
the  two  forms  gives  us  the  relation 

'  \/2.o- 

Thus  h  is  an  inverse  measure  of  the  "scatter"  of  the  frequency- 
surface.    It  is  called  the  "  measure  of  precision." 

Its  relation  to  the  mean  variation  M.V.  (sometimes  known 
as  the  average  deviation,  a.d.)  is  given  by  the  equation 

1 

M.V.  =  — =  . 


IT 


THE  PSYCHO-PHYSICAL  METHODS 


27 


Thus,  again,  in  the  case  of  the  normal  curve, 


o-  =  A/  |.M.v.  =  1-2533 M.v. 


•4769 

Finally,  p.e.  =  -67449  a  =  —j- . 

Obviously  the  mode,  mean,  and  median  coincide. 

The  ordinates  corresponding  to  x  =  ±a-  cut  the  curve  in  the 
two  points  of  inflexion  (i.e.  the  points  where  the  curve  changes 
from  concavity  to  convexity,  or  vice  versa). 

The  mathematical  deduction  of  the  equation  of  the  pro- 
bability curve  is  based  upon  the  general  theory  of  independent 
probability. 

It  is  developed  from  the  binomial  expansion  of  (p  +  viz. 

{71.   X  )  7h  \  7l/  1  ) 

the  terms  of  which  represent  the  distribution  of  chances  of  n 
independent  events,  where  the  probability  of  occurrence  of  each 
single  event  in  one  trial  is  p  and  the  probability  against  its 
occurrence  is  q  (i.e.  p  +  q  =1).  The  first  term  is  the  probability 
that  all  will  happen,  the  second  that  n  —  1  will  happen  and 
1  fail,  the  third  that  w  — 2  will  happen  and  2  fail,  and  so  on. 
The  single  events  must  be  independent  of  one  another.  Thus 
if  n  teetotums  are  taken,  each  of  which  is  marked  in  the  ratio 
of  ^  to  q,  so  that  the  chance  that  any  teetotum  will 
fall  on  the  marked  piece  is  p,  then  the  distribution 
of  chances  when  these  n  teetotums  are  spun  N 
times  is  given  by  the  terms  of  the  expansion, 
N{p-\-  qY.    In  the  stock  example  of  coin  spinning,     ^'ig-  6. 

It  can  be  proved  that  the  Gauss  curve  is  an  extremely  close 
approximation  to  the  binomial  frequency  polygon 

(1)  whatever  be  the  value  of  n,\i  p  =  q  =  ^, 

(2)  with  inequality  of  p  and  5,  provided  that  neither  p  nor 
q  is  very  small,  and  that  n  is  very  large. 
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It  is  not  in  agreement 

(1)  if  ^  or  g  is  very  small, 

(2)  if  there  is  correlation  between  the  events  or  contri- 
butory causes,  i.e.  if  the  events  are  not  completely  independent. 

The  standard  deviation  (a)  of  the  binomial  frequency  =  ^npq, 
that  of  the  Gaussian  =  \/(n  +  l)pq. 

The  above  brief  account  must  sufBce  for  the  theoretical 
discussion  of  the  Gauss  curve*.  Returning  to  Mtiller's  method, 
we  find  it  assumed  that  the  successively  increasing  percentages 
of  two-point  judgments  correspond  to  the  increasing  area  of  the 
portion  of  a  Gaussian  frequency-distribution f  included  between 
a  limiting  ordinate  OP  (see  accompanying  figure),  the  axis  of  x, 
the  curve  and  the  moving  ordinate  pD.  Thus  the  area  Pp^Dfi 
represents  the  percentage  of  two-point  judgments  given  for  an 


objective  distance  between  the  compass  points  represented  by 
the  abscissa  OA,  the  area  Pp^D^O  represents  the  percentage 
for  a  distance  OD^.  Our  task  is  to  determine  the  distance  OA 
which  gives  50%  two-point  judgments,  represented  by  half  the 
frequency-surface,  viz.  the  area  PpsDfi.  Let  us  call  the  average 
limen  (represented  by  OA  in  the  figure)  8,  and  any  particular 
stimulus-value  employed,  D.  Then,  in  a  sufficiently  large 
number  of  experiments  with  D>8  (e.g.  =  OA  in  figure),  so  that 

•  For  a  very  clear  elementary  exposition,  see  Titcliener,  Experimental 
Psychology,  Vol.  ii.  Student's  Manual,  pp.  38-55,  on  "  The  Law  of  Error."  Also 
M.errima,n,  Method  of  Least  Squares,  Tpp.  &— 25.  _ 

t  As  stated  above,  p.  20,  footnote,  there  is,  strictly  speaking,  no  ]«Btification 
for  treating  these  percentages  as  parts  of  a  "  frequency-distribution.  This  will 
be  made  more  clear  later  on. 
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the  percentage,  Z,  >  50,  taking  the  whole  area  of  the  surface 
as  1,  we  have 

^=  area  Pp^DoO  =  area  PpsD.O  +  area  p,p,D^B, 

Vtt  J  0 

or,  putting  hB  =  t, 

V7r-/o 

Similarly,  with  D<S  (e.g.  =  OD,  in  figure),  we  have 

^  =  area  PpiD^O  =■  area  PpsDsO  -  area  p^psD^Di 

Vvr-'  -(S-D) 

1 


=  1-^ 

VtT^  -[S-D)h 

Turning  now  to  our  experimental  results,  if  we  omit  the 
first  and  last  series  (i)  =  0  and  Z)  =  6)  for  reasons  previously 
given,  viz  that  30  "/o  for  i)  =  0  is  an  inversion  of  the  first  order, 
and  100  "/o  for  D  =  6  shows  that  the  limits  of  fluctuation  of  S  have 
been  passed,  and  call  the  other  D's  D^,  D.2,...Dt,  and  the  corre- 
sponding percentages  Z-^,  Z^,...Zt,  then  the  values  t^,  ti,...^ 
corresponding  to  the  Z's  are  given  by  Fechners  Fundamental 
Table  (see  Appendix  I,  Table  I) ;  and  we  have  the  equations : 

t,  =  (D,-S)  h, 


t,  =  (Dr  -  S)  h, 

which,  if  ^l,  t^,  etc.  were  "observed"  values,  could  be  solved  for 
S  and  h  by  the  Method  of  Least  Squares. 

But  these  lvalues  are  not  observed  values;  they  are 
theoretically  deduced  from  the  observed  2^- values.  Hence  we 
must,  as  it  were,  change  them  into  observed  values  by  "  weight- 
mg"  them.  They  are  to  be  weighted  both  by  tu',  a  weight 
proportional  to  the  number  of  observations  upon  the  basis  of 
which  the  corresponding  Z's  were  determined  and  also  by  w" 
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to  change  them  into  observed  values.  Thus  t,  is  to  be  multi- 
plied by  w,'w^\  U  by  ^m/',  and  so  on.  Since,  however  the 
number  of  observations  is  almost  universally  equal  in  each 
series  of  experiments  (giving  the  different  Z'b),  w'  is  in  almost 
all  cases  =  1,  so  that  w"  is  alone  needed.  This  is  given  by 
MiXller's  Table  of  Coefficients  of  Weights*  (see  Appendix  I,  Table 
II).    We  thus  have  the  equations : 

w,t,  =  (D,-S)h, 
w^t^  =  (Da  -  S)  h, 


which,  solved  by  the  Method  of  Least  Squares,  give  the 
"  normal "  equations  : 

t  (D^w)  .h-t  {Dw)  .Sh  =  X  (Ltw), 

-2(Dw).h+-%(w).Sh  =  -l(tw}, 

where  2  (D^w)  =  (D,'w,  +  D,'w,+  ...+  Dj'w,),  and  similarly  with 
the  others. 

Titchener  has  worked  out  Riecker's  results  by  this  method, 
and  gets 

A  =  0-49,  ^=l-88. 

In  Fechner's  original  discussion  of  this  method,  there  was 
no  sufficiently  clear  distinction  made  between  the  quantities 

S  and  h.    It  was  assumed  that  8  and  \  measure  much  the 

k 

same  quantity,  i.e.  that  the  product  Sh  is  a  constant.  Miiller 
was  the  first  to  distinguish  clearly  between  the  two  measures 
in  psycho-physics  and  to  point  out  that  8h  is  by  no  means 
always  a  constant  quantity,  although  approximating  to  it  in 
certain  cases.  S  and  h  do  not  always  vary  inversely  with  one 
another,  in  a  series  of  determinations  of  a  particular  threshold. 
In  some  cases  they  show  a  tendency  to  vary  directly. 

The  same  method  is  applicable  to  the  determination  of 
a  difference-limen.    In  Miiller's  form  of  the  method  the  most 

*  A  more  correct  table  is  that  suggested  by  F.  M.  Urban,  which  is  also 
given  in  Appendix  I. 
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representative  limen  is  taken  to  be  that  corresponding  to  50  "/o 
"right  answers"  (and  therefore  also  50 "/o  wrong + uncertain  or 
"equal"  answers).    The  formulae  required  are: 

1  r{±D-DLu)hu 

1  =  1,+^  e-m, 

U  = 


VttJ  {±D-DLu.)h.u, 

where  answers  "greater"  ^  =  less,     =  equal  or  un- 

certain, DL,,  =  upper  difFerence-limen,  DLi  =  lower  difference- 
limen,  D  =  any  particular  stimulus-difference  (V  -  S)  employed, 
and  is  -|-  or  -  according  as  the  variable,  V,  is  greater  or  less 
than  the  standard,  S. 

hu,  hi  are  the  measures  of  precision  of  the  upper  and  lower 
limen,  respectively. 

The  results — DL's  and  h's — are  obtained,  as  before,  by 
means  of  Fechner's  Fundamental  Table  and  Miiller's  Table  of 
Coefficients  of  Weights*, 

Where  time-errors  or  space-errors ■}•  or  both  are  possible, 
the  values  of  DL  and  h  are  determined  separately  for  each 
time  or  space  order.  In  these  cases  the  distributions  often  tend 
to  be  "  skew,"  even  when  the  results  all  lumped  together  give 
an  approximation  to  the  Gaussian  curve.  Indeed  Miiller  him- 
self emphasizes  the  unjustifiability  of  the  assumption  that 
all  distributions  obtained  by  the  constant  method  must  be 
normal  or  even  must  be  symmetrical,  and  shows  that  for  certain 
of  Merkel's  results^  a  "  zweiteiliges  Gausssches  Gesetz"  gives 
a  very  good  fit.  This  "  bipartite  "  or  compound  Gaussian  law 
differs  from  the  ordinary  one  simply  in  having  two  h's  or 
measures  of  precision,  one  for  the  positive  deviations  of  the 
individual  liminal  values  from  their  central  value,  and  another, 
a  different  one,  for  the  negative  deviations.    For  a  brief 


*  Better,  Urban's  Table, 
t  See  above,  pp.  15,  16. 
X  G.  E.  MuUer,  op.  cit.  pp.  92,  93. 


32 


THE  PSYCHO-PHYSICAL  METHODS 


criticism  of  the  law  of  distribution  suggested  by  Bruns,  and 
known  by  his  name,  the  reader  is  referred  to  Mtiller,  op.  cit. 
pp.  90,  91.  The  whole  question,  though  important,  is  too 
difficult  to  come  within  the  scope  of  this  book. 

Correspondence  of  Results  obtained  by  the  Three 
Psycho-Physical  Methods. 

It  has  already  been  pointed  out  that  the  average  error  or 
mean  variation  of  the  Method  of  Average  Error  corresponds 
closely  to  the  mean  variation  obtained  from  the  distribution 
of  "  equal "  or  "  uncertain "  answers  in  the  Constant  Method. 
The  relation  existing  between  determinations  of  liminal- values 
by  the  Limiting  Method  and  Constant  Method,  respectively, 
is  more  complex,  and  may  be  discussed  from  two  somewhat 
distinct  points  of  view,  viz.  the  mathematical  and  the  psycho- 
logical. Assuming  identical  or  approximately  identical  psycho- 
logical conditions  in  the  employment  of  the  two  methods,  the 
result  (value  of  the  limen)  will  be  the  same  if  the  chance 
values  of  the  limen  form  a  normal  frequency-distribution, 
different  if  this  frequency-distribution  is  skew.  In  the  latter 
eventuality  the  limen  given  by  the  Limiting  Method  will,  it  is 
clear,  be  greater  or  less  than  the  limen  given  by  the  Constant 
Method  according  as  the  frequency-distribution  has  its  mode 
towards  the  left  (zero  end)  or  towards  the  right.  In  the 
ordinary  forms  of  the  two  methods,  however,  the  psychological 
conditions  must  be  very  dilferent.  They  can  be  approximated 
to  one  another  by  getting  the  subject,  in  the  Constant  Method, 
to  reply  "much  larger,"  "much  smaller"  as  well  as  "larger," 
"smaller*."  In  calculating  the  result  only  the  last  two  series 
of  answers  are  employed. 

It  has  generally  been  held  hitherto  that  the  Constant 
Method  gives  more  reliable  results  than  the  Method  of  Limits, 
but  F.  M.  Urban  t  has  recently  brought  forward  reasons  for 
*  Cf.  Ebbingbaus:  GrundzUge  der  Psychologic,  1905,  p.  89;  but  see  Miiller, 
op.  cit.  p.  160. 

t  F  M  Urban-  "On  the  Method  of  Just  Perceptible  Differences,"  Psyc/io- 
logical  Revieiv,  Vol.  xiv.  1907,  pp.  244-253.  See,  too,  Vol.  xvii.  1910,  "  The 
Method  of  Constant  Stimuli  and  its  Generalizations,"  pp.  229— 2o9,  by  the 
same  author. 


THE  PSYCHO -PHYSICAL  METHODS 


33 


believing  that,  under  certain  conditions  of  experimentation,  the 
converse  is  the  case.  He  employs  the  following  formula  for 
the  measure  (M)  of  the  limen : 

M  =  ^  {r,P,N  +  r,P,N  +...+  VnPnN)  =  r,P,  +  r^P,  +  . . .  +  r^P,,, 

where  7-i  <  7-3  <  . . .  <  Vn  are  the  successive  values  of  the  variable- 
stimulus  (F)  employed,  and 

P,=i?i, 

P2=  qiPi, 

P?i  ~  9'i9'2  ■  •  •  Qn—iPnt 

where,  again,  the  ^s  and  q's  need  to  be  explained  ;  thus 

pic  is  the  probability  that  in  the  comparison  of  the  stimulus  Vk 
with  the  standard  the  judgment  "greater"  will  be  given, 

and 

is  the  probability  that  a  judgment  will  be  given  which  is  not 
a  "greater "  judgment,  and  therefore  =  1  -p^. 

N  =  the  number  of  series  of  experiments  made. 

This  modified  form  of  the  method  evidently  approximates 
closely  to  the  Constant  Method.  The  p's  are  the  percentage- 
values  obtained  in  the  latter.  The  P's  are  more  reliable, 
though  smaller,  than  the  p's.  Pk  is  a  measure  of  the  component 
probability  that  is  judged  "greater"  and  that  on  all  the 
smaller  r  (rj . . .  v/c-i)  judgments  are  given  which  are  not  "greater" 
judgments.  Since  the  value  of  the  P's  is  not  changed  by  the 
order  in  which  the  r's  are  presented,  these  latter  may  be 
presented  in  any  order,  thus  eliminating  the  influence  of 
expectation,  and  bringing  the  method  very  close  indeed  to  the 
Constant  Method.  Another  important  point  is  "  to  vary  the 
steps  'by  which  one  approaches  the  threshold,'  because  other- 
wise one  can  not  make  the  supposition  of  a  symmetrical 
distribution"  (p.  247).  Urban  compared  the  results  obtained 
by  means  of  his  formula,  using  lifted  weights,  with  the  results 
obtained  empirically  by  (as  it  were)  directly  observing  P^,  or 

B-  3 
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something  proportional  to  Pk,  viz.  Nk,  "  the  number  of  times 
it  occurred  that  each  weight  [in  this  case,  was  the  lightest 
weight  of  the  entire  series  to  be  judged  'heavier'"  (p.  249). 
The  agreement  of  observed  results  and  theoretical  results  was 
found  to  be  very  close  in  all  the  cases.  The  article  concludes 
as  follows:  "The  theoretical  basis  of  the  method  of  just 
perceptible  differences  is  the  same  as  that  of  the  error  method, 
namely  empirical  determinations  of  the  probabilities  of  judgments 
of  different  types  on  given  differences  of  intensity.  The  result 
of  the  so-called  method  of  just  perceptible  differences  is  that 
amount  of  difference  for  which  there  exists  the  probability  one 
half  that  it  will  be  recognised"*  (p.  253). 

One  aspect  of  the  psycho-physical  methods  has  been  scarcely 
mentioned  in  all  the  preceding  description,  viz.  the  attitude 
of  the  subject.  Its  importance  is  hardly  likely  to  be  overlooked 
in  the  present  state  of  opinion  in  psychology.  It  is  superfluous 
to  emphasize  the  need  of  careful  introspective  reports  (given 
either  at  the  end  of  the  experimental  series  or  after  each 
observation)  in  all  quantitative  determinations,  liminal  and 
supraliminal  alike. 

The  degree  of  "  fore-knowledge  "  of  the  subject,  the  form 
in  which  he  is  asked  to  give  his  judgment — in  terms  of  standard 
or  variable,  of  right-hand  or  left-hand  stimulus,  etc. — and 
other  recognisable  psychological  factors  directly  bearing  upon 
the  quantitative  determination,  should  be  varied  in  as  many 
ways  as  possible  and  the  corresponding  results  noted ;  the  effects 
of  practice,  fatigue,  "  contrast,"  "  side-comparisons,"  "  absolute 
impression,"  etc.  should  be  reduced  to  a  minimum  or  else 
rendered  constant  and,  where  possible,  measured ;  in  short,  the 
quantitative  determinations  should  always  be  made  for  the  sake 
of  the  psychological  processes  involved,  in  the  fullest  sense  of  the 
phrase,  and  not  conversely.  The  problems  of  the  psychological 
processes  of  discrimination,  judgment,  etc.  thus  raised  are  of  the 
first  importance,  but  their  discussion  would  take  us  beyond 

*  Of.  Kiilpe :  the  probable  value  of  the  limen  is  "  that  stimulus  difference 
which  is  just  as  often  cognized  (correctly  judged)  as  not  cognized  (incorrectly 
judged)  "—quoted  from  Sanford;  Experimental  Psychology,  footnote,  p.  351. 
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the  scope  of  this  book.  We  can  only  just  mention  them  here, 
emphasize  their  importance,  and  indicate  the  relation  in  which 
they  stand  to  mental  measurement  in  the  narrowest  sense. 

Frequency-Distributions. 

We  have  seen  above  that  the  graphical  representation  of 
the  percentages  of  any  one  of  the  three  kinds  of  judgments 
obtained  in  the  Constant  Method,  although  sometimes  approxi- 
mating to  the  form  of  a  Gaussian  frequency-distribution,  is  in 
many  cases  decidedly  "  skew."  In  these  latter  cases  the  most 
representative  limen  would  be  given  by  the  mode  of  the  smooth 
curve  which  best  fits  the  figure.  Unfortunately  the  determina- 
tion of  the  equation  of  such  a  best-fitting  curve  must  be 
almost  entirely  empirical  in  nature,  since  we  have  so  slight 
a  theoretical  basis  on  which  to  build.  Urban's  notion  of  the 
probability  of  a  judgment,  illustrated  in  the  preceding  section, 
seems  to  be  the  most  promising  as  a  starting-point,  and  there 
is  little  doubt  that  his  publications*  are  the  most  important 
for  the  modem  student  of  psycho- physics.  Much  still  remains 
to  be  done,  however,  and  great  caution  is  needed  in  the  theo- 
retical estimation  of  results.  We  have  already  suggested  that 
the  identification,  or  even  the  approximation  of  graphical 
representations  of  "percentage"  judgments  to  true  frequency- 
distributions  is  of  extremely  doubtful  validity.  This  will 
become  clearer  on  a  consideration  of  the  nature  of  the  latter. 
True  frequency-curves  represent  the  distribution  of  "  random 
samples  "  taken  from  a  "  total  population,"  and  therefore  have 
a  definite  mathematical  basis  in  the  theory  of  probability. 
They  correspond  to  the  results  obtained  in  drawing  balls  from 
a  bag,  or  in  teetotum-spinning,  dice-throwing,  etc.  Cases  of 
this  kind  obviously  occur  in  great  numbers  in  psychology, — 
e.g.  the  distribution  of  a  particular  kind  of  mental  ability  in 

*  In  addition  to  articles  already  quoted,  see  "  Die  psychophysischen  Mass- 
methoden  als  Grundlagen  empirischer  Messungen,"  Archiv  f.  d.  ges.  Psychologic, 
Vols.  XV.  and  xvi. ;  also,  The  Application  of  Statistical  Methods  to  the  Problems 
of  Psychophysics,  Philadelphia,  1908. 
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a  group  of  individuals, — and  hence,  though  not  strictly  relevant 
to  the  subject  of  the  present  chapter,  they  are  relevant  to  the 
general  problem  of  mental  measurement,  and  are  most  con- 
veniently discussed  at  this  point. 

The  general  theory  of  curve-fitting  has  been  worked  out 
in  great  detail  by  Professor  Karl  Pearson*.  A  good  account 
of  his  method  is  given  in  a  book  by  W.  Palin  Elderton, 
Frequency-Curves  and  Correlation,  C.  and  E.  Layton,  London, 
1906,  to  which  the  mathematical  reader  is  referred  for  fairly 
complete  theoretical  and  practical  information.  The  following 
short  sketch  of  the  outline  of  Pearson's  system  will  perhaps  serve 
as  a  reminder  for  readers  already  acquainted  with  the  methods, 
and  in  any  case  will  be  of  some  little  use  in  indicating  the 
general  significance  of  biometric  notation  which  will  be  em- 
ployed later  on  in  this  book. 

Frequency-curves  of  data  not  involving  a  mixture  of  species 
tend  to  commence  at  zero,  rise  to  a  maximum  and  then  fall 
either  at  the  same  or  at  a  different  rate.  There  is  often  high 
contact  at  one  or  both  ends  of  the  distribution.  An  equation 
of  the  general  form 

dy  ^{x-^a)y 
dec  f{x) 

satisfies  these  conditions,  since  if  y  =  0,  ^  =  0  (high  contact) 

and  if  a;  =  -  a,  ^  =  0  (maximum ;  for  a  maximum,  again,  the 

ax 

second  differential  coefficient  must  be  negative).  Expanding 
f  {x)  by  Maclaurin's  theorem,  we  have 

d^^  (x  +  a)y  ^ 

dx      Co  +  CiX+C2X'^  +  C30^+ ..." 

*  Karl  Pearson,  "  Skew  Variation  in  Homogeneous  Material,"  Phil. 
Trans.  Vol.  186  a,  343  £f. ;  "On  the  Systematic  Fitting  of  Curves  to  Observa- 
tions and  Measurements,"  Biometrika,  Vol.  i.  pp.  265 ff.  and  Vol.  ii.  pp.  Iff., 
1901—1903;  "On  the  Curves  which  are  most  suitable  for  describing  the 
frequency  of  Eandom  Samples  of  a  Population,"  Biometrika,  Vol.  v.  1906, 
172—175  (an  exceedingly  clear  summary  of  the  principles  involved). 
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(1)  Putting  ci  =    =  C3  =  . . .  =  0,  we  have 

Idy  _  x+a 
ydx      Co  ' 

which  is  the  Gaussian  or  normal  curve.  It  fits  the  symmetrical 
binomial,  (i  +  if.  ^S-  in  coin-tossing,  where  the  chances  for 
and  against  are  equal  (p  =  q),  and  the  contributory  causes  are 
independent  of  one  another. 

(2)  Putting  C2  =  C3  =  . . .  =  0,  we  have 

Idy  _  x  +  a 
ydx    Co  4-  CiX ' 

which  represents  a  class  of  curves  varying  from  the  Gaussian 
curve  to  the  J-curve.  It  fits  the  asymmetrical  or  "point" 
binomial,  (p  +  q)'^,  e.g.  in  teetotum-spinning  or  dice-throwing, 
where  the  chances  for  and  against  are  not  equal,  p^q,  but  the 
contributory  causes  are  independent  of  one  another. 

(3)  Putting  C3  =  C4  =  . . .  =  0,  we  have 

Idy  _      x  +  a 
ydx    Co  -H  CiX  +  ' 

which  can  be  made  to  represent  almost  all  the  frequency 
distributions  which  may  arise.  It  fits  the  "hypergeometrical 
series,"  the  successive  terms  of  which  give,  e.g.,  the  chances  of 
getting  r,  r  —  1,  ...  0  black  balls  from  a  bag  containing  pn  black 
and  qn  white  balls  when  r  balls  are  drawn*. 

Here  the  contributory  causes  are  not  independent  of  one 
another. 

There  is  no  advantage  in  employing  equations  which  involve 
C3  and  higher  constants,  because  their  use  involves  the  calculation 
of  the  6th  and  higher  "moments,"  and  these  have  very  high 
probable  errors. 

Definition.    The  nth  moment  coefficient  of  any  distri- 

bution about  any  ordinate  is  the  sum  of  the  products  of  the 

*  The  series  is : 

p?t(pn- 1) ...  (pn-r+1)  L         rgn        r  (r- 1)  qn  (qn  - 1)  1 

n(n-l) ...  (n-r+l)     {'^pn-r+l'^     \2     '{pn-r  +  l)  (pn-r  +  2)  •••[ 
Other  series  may  arise. 
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partial-frequencies  and  the  7ith  power  of  the  distances  of  thei 
frequencies  from  the  ordinate,  divided  by  the  total  frequency 


ese 


centroid 
•vertical 


B 


r 


In  symbols,  if  N  be  the  total  frequency. 

The  moments  are,  in  practice,  first  calculated  about  any 
arbitrary  ordinate  that  is  most  convenient,  and  then  reduced  to 
moments  about  the  centroid  in  the  following  way*: 


11  {n  —  \) 
1.2 


This  gives  the  general  reduction  formula 


n(n—l) 
1.2 


'2  ' 

A*  1  ^  n— 2  ■ 


n(n-l)(n-2) 
1.2.3 


which  enables  us  to  transfer  any  moment  from  an  arbitrary 
ordinate  to  the  mean.    Thus  we  have: 

H'2  =  H'\  -  /^V, 

/*3  =  H-'z  -  3a<'V'2  +  2/x'i^ 

/x's  represent  moments  of  the  curve;  but  we  have  to  start  with 
grouped  frequencies,  where  the  frequencies  are  assumed  to  be 
concentrated  along  the  mid-ordinates  of  the  rectangles  (cf. 
Fig.  8,  B).  The  moments  obtained  from  these  grouped  fre- 
quencies are  denoted  by  z/'s  (dashed  and  undashed),  and  coiTec- 
tions  are  necessary.  These  have  been  deduced  by  Sheppardf 
and  are  consequently  known  as  Sheppard's  adjustments. 

*  Dashed  yu's  represent  moments  about  an  arbitrary  ordinate, 

undashed  fx' a       ,,  ,,  ,,     the  centroid  vertical, 

t  W.  F.  Sheppard  :  "On  the  Calculation  of  the  most  Probable  Values  of 
Frequency  Constants,  for  Data  arranged  according  to  Equidistant  Divisions  of 
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They  are :  ^2  =  1^2  —  tV  ' 

/is  =  Vs, 

It  is  generally  said  that  they  are  only  valid  when  there  is 
"  high  contact "  at  the  ends  of  the  frequencies,  but  the  equations 
for  and  fis  are  probably  still  valid,  even  without  high  contact, 
if  the  terminal  frequencies  are  zero. 

[N.B.  In  working,  v"s  are  changed  into  v's  before  applying 
Sheppard's  corrections.] 

It  is  obvious  that  i^o  =  1  and  =  0.  v\  is  the  distance  of 
the  mean  or  centroid  vertical  from  the  arbitrary  ordinate  about 
which  the  moments  are  first  taken,  and  is  conveniently  known 
as  d.    cr  =  V/*2- 

Two  very  important  constants  in  curve-fitting  are 

A  =  ^3>and^2=^:. 

The  values  of  these  are  always  to  be  calculated,  and,  within 
the  limits  of  their  probable  errors  (for  which  see  Rhind's  Tables, 
Biometrika,  Vol.  vii.  1909),  they  fix  the  type  to  which  the 
curve  belongs.  The  general  frequency-curve  equation,  written 
in  terms  of  moments,  is 

/^i     (^2  +  3) 

1   2  5^2-6i3i-9  

ijdx-        (     4fe-3^i  (/32  +  3)      X       2/32 -3ft -6     /A^l  ' 

"  |l0i32-12|3i-18      2    5;82-6i3i-9V     10^2-12|3i--18"  U/  ) 

This  gives  at  once,  for  the  distance  between  the  mean  and 
mode, 

"  =  -2(5A-6A-9)'    (Ongm  IS  at  mean.) 

Hence  the  curve  is  symmetrical  if  ySj  =  0.  If  =  0,  and 
/Sg  =  3,  the  curve  reduces  to  the  Gaussian,  since  the  terms  in- 
volving X  in  the  denominator  of  the  right-hand  side  of  the 
general  frequency-curve  equation  vanish. 

In  using  the  Pearsonian  method,  then,  the  order  of  pro- 
cedure to  be  adopted  is : 

a  Scale,"  Proc.  Lond.  Math.  Soc.  Vol.  xxix.  pp.  353  ff.  Karl  Pearson :  "  On  an 
Elementary  Proof  of  Sheppard's  Formulae  for  correcting  Raw  Moments  and  on 
other  allied  Points,"  Biometrika,  Vol.  iii.  1904,  pp.  308  ff. 
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(1)  calculate  the  moment-coefficients  v\,  v\,  v'.^,  v\  about 
a  convenient  arbitrary  ordinate, 

(2)  transfer  to  the  mean  by  the  equations 


Vi  =  v\  -  4i/'i  v\  -t-  6j/'i^  v\  -  3i/',* 

(y'l,  or  is  the  distance  of  the  mean  from  the  arbitrary 
ordinate), 

(3)  determine  the  corresponding  moments  for  the  curve 
by  the  equations 

t^  —  v^.  —  \ 

fis  =  V3  I    Sheppard's  corrections. 

=  f4  -      +  iTuJ 
[N.B.    For  these  corrections  to  be  applicable,  two  conditions 
must  be  fulfilled : 

(i)  there  must  be  high  contact, 

(ii)  the  grouping  of  the  frequencies  must  be  equal.] 

(4)  Calculate  /3i  and  ^2  by  the  equations 

These  results  give  the  distance  of  the  viean  (v\  or  d)  from 
the  arbitrary  ordinate,  the  standard  deviation  (V/Xs),  and  the 
mode, 

The  investigator  may  then  proceed  to  determine  to  which 
of  the  seven  Pearsonian  "  types  "  the  particular  curve  belongs, 
to  find  its  equation,  and  to  plot  it.  For  instructions  on  these 
points  the  reader  is  referred  to  Palin  Elderton's  book  already 
mentioned. 
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CORRELATION 


INTRODUCTION 


A  somewhat  detailed  account  of  the  mathematical  theory 
of  correlation  and  of  the  way  in  which  it  may  be  usefully 
applied  to  psychological  measurements  will  be  found  in  the 
later  chapters  of  this  Part.  The  object  of  the  following 
introductory  pages  is  to  give  the  reader  a  general  preliminary 
view  of  the  method,  free  from  mathematical  complications,  and 
to  illustrate  it  by  means  of  a  simple  example. 

Correlation  may  be  briefly  defined  as  "tendency  towards 
concomitant  variation,"  and  a  so-called  correlation  coefficient 
(or,  again,  correlation  ratio)  is  simply  a  measure  of  such 
tendency,  more  or  less  adequate  according  to  the  circumstances 
of  the  case.  J.  S.  Mill,  in  his  "  System  of  Logic,"  distinguished 
a  special  scientific  "  Method  of  Concomitant  Variations,"  which 
he  based  upon  the  following  principle : 

"  Whatever  phenomenon  varies  in  any  manner  whenever 
another  phenomenon  varies  in  some  particular  manner,  is 
either  a  cause  or  an  effect  of  that  phenomenon,  or  is  connected 
with  it  through  some  fact  of  causation*." 

The  instances  of  this  principle  which  Mill  had  in  mind  were 
mainly  cases  of  approximately  "  complete "  concomitance  of 
variation,  such  as  those  usually  met  with  in  the  domain  of 
Physics.  In  such  cases,  the  conditions  of  an  experiment  admit 
of  a  high  degree  of  simplification,  the  phenomenon,  or  series  of 

*  Logic,  Bk.  iii.  Ch.  viii.  §  t>. 
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phenomena,  under  investigation  can  be  isolated  with  tolerably 
complete  success,  the  "  irrelevant "  factors  can  be  reduced  to  a 
minimum.  Under  such  conditions,  when  the  degree  of  con- 
comitance of  the  different  corresponding  measures  of  the  two 
phenomena  is  found  to  be  very  high,  the  slight  deviations 
from  complete  correspondence  are  put  down  to  "errors  of 
observation  "  or  other  unavoidable  imperfections  in  the  experi- 
mental method  employed. 

If  the  correspondence  is  one  of  simple  proportionality,  so 
that  the  graphical  representation  of  it  (one  phenomenon  being 
measured  along  the  axis  of  x,  the  other  along  the  axis  of  y) 
is  a  straight  line*,  the  correlation  coefficient,  r,  will  be  =  l. 
Example:  the  variation  of  the  length  of  a  metal  rod  with 
temperature. 

If  the  correspondence,  although  still  approximately  complete, 
is  not  one  of  simple  proportionality,  the  graphical  representation 
of  it  will  be,  not  a  straight  line,  but  a  curve  of  greater  or  less 
degree  of  complexity -f-,  and  the  correlation,  also  complete,  will 
be  measured  not  by  the  correlation  coefficient,  r,  but  by  the 
correlation  ratio,  t).  77  in  this  case  will  be  =  1.  Example :  the 
variation  (inverse)  of  the  volume  of  a  certain  quantity  of  gas  ^ 
with  the  pressure  to  which  it  is  subjected,  the  temperature 
remaining  constant.  A  number  of  pairs  of  values,  Pj,  V^; 
-P2,  1^2;  P3,  V3',  etc.  are  obtained,  and  when  plotted  are  found 
to  give  a  "  scatter  diagram "  of  the  approximate  form  of 
Figure  9. 

In  this  figure,  the  dots  represent  the  individual  pairs  of 
observations,  P,  V.  They  cluster  very  closely  about  the 
hyperbola,  PV=k,  represented  by  the  broken  curve.  The 
curve  is  assumed  to  represent  the  "  real "  or  "  true  "  relation  of 
the  two  "variates"  (as  we  call  such  quantities  as  P,  V),  and 
the  slight  deviations  of  the  observed  values  from  this  curve  are 
explained  as  due  to  errors  of  observation  and  to  other  factors 
irrelevant  to  the  relation  under  investigation.  However  this 
may  be,  the  interesting  point  about  the  figure  so  far  as  our 

*  Hence  the  correlation  is  said  to  be  "linear." 
t  Correlation  said  to  be  "non-linear"  or  "skew." 
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present  purpose  of  explaining  correlation  is  concerned  is  that 
any  definite  observed  P- value  is  "correlated"  with  a  plurality 
or  "array"  of  observed  F-values,  and  that,  similarly,  any 


V. 


:v 


^  p 


Fig.  9. 


definite  observed  F-value  is  correlated  with  a  plurality  of 
P-values.  These  arrays  of  observed  values  cluster  extremely 
closely  about  their  means  (situated  on  the  curve),  i.e.  their 
"scatter"  or  "variability,"  as  measured  by  their  standard 
deviations  (a),  is  extremely  small. 

The  modern  theory  of  correlation  is  directed  towards  the 
manipulation  of  observations  made  upon  phenomena  of  a  much 
greater  degree  of  variability  than  that  found  in  the  case  of 
isolated  physical  phenomena.  The  increased  variability  is  no 
doubt  due,  in  the  main,  to  the  complexity  of  factors  involved. 
The  elementary  factors  do  not  admit  of  isolation,  and  with 
reference  to  the  concomitance  of  variation  of  the  two  series  of 
phenomena  under  consideration  they,  as  it  were,  pull  in  different 
directions.  The  correlation  coefficient  and  correlation  ratio 
measure,  in  these  cases,  the  average  extent  of  the  concomitance. 
As  will  be  explained  more  fully  in  the  next  chapter,  r  can 
only  be  taken  as  a  measure  of  correlation  when  the  average 
relation  between  the  two  variates  is  linear,  and  in  this  case  its 
value  is  identical  with  that  of  rj.    When  the  relation  is  non- 
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linear  r  is  practically  meaningless,  but  v  still  measures  the 
relation  accurately. 

The  general  problem  will  become  clearer  by  reference  to 
the  accompanying  figure. 


mech,  memory. 


Fig.  10. 


Let  US  assume  that  we  have  a  group  of  200  school-children 
and  have  measured  each  of  them  for  mechanical  memory  (x) 
and  for  general  intelligence  (y).  Each  of  the  dots  in  the  figure 
represents  a  child.  Then  if  we  determine  the  mean  y-values 
corresponding  to  each  successive  "group"  of  values,  e.g.  to 
X3,  by  assuming  the  observations  concentrated  on  the  mid- 
ordinate  PM*,  the  line  ABf  drawn  through  these  mean 
y-values  (marked  by  crosses  in  heavy  type)  represents  the  law 
of  change  of  mean  y-value  with  increase  of  x  and  gives  the 

*  The  true  centroid  ordinate  is  slightly  nearer  the  denser  part  of  the  scatter- 
diagram,  here  slightly  towards  the  right  of  FM.  The  correction  is  made  later 
by  means  of  Sheppard's  formulae  (see  above  p.  39,  and  Appendix  II,  p.  137). 

t  I  have  placed  the  means  on  the  straight  line  for  the  sake  of  convenience 
of  exposition.  Actually,  they  will  occur  irregularly  on  either  side  of  it,  and 
AB  will  be  the  "best-fitting"  straight  line,  determined  by  an  application  of  the 
Method  of  Least  Squares.  See  next  chapter  and  also  Fig.  16,  Appendix  II,  p.  140. 
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"  most  probable  "  value  of  y  for  any  particular  value  of  x.  If 
the  line  is  straight  or  approximately  straight  the  "  regression  " 
is  said  to  be  linear,  and  the  equation  to  the  line  is 

y-y  =  r^{x-x), 

where  x,  y  are  the  mean  values  of  all  the  x's  and  y'^  respectively 
{not  the  means  of  the  arrays  just  mentioned), 

o-j,  o-j  are  the  standard  deviations  of  the  a-'s  and  ?/'s  respec- 
tively and  r  is  the  coefficient  of  correlation. 
Thus,  if 

20  is  the  mean  value  of  the  mechanical  memory  of  the  group  (^'^^^ 
30  „   „       „       „      „    „   general  intelligence  „    „       „  (jic^tCt 
4  „   „    standard  deviation  for  mechanical  memory  (o-j) 
7  „   „        „  „        „  general  intelligence  (o-„) 

and,  finally,  "6  is  the  value  of  r;  then  the  most  probable 
measure  of  the  general  intelligence  of  a  child  whose  mechanical 
memory  is  represented  by,  say,  the  value  14,  is  given  by  the 
equation 

2/-30  =  -6x  1(14-20), 
whence  y  =  23'7. 

This  value  is  the  average  of  an  array  of  possible  values,  whose 

standard  deviation   

=  0-2 Vl  - 

=  5;6. 

It  will  be  proved  in  the  next  chapter  that 

8 {xy) ^ 
'  ~  Na,a.,  ' 

where  x  and  y  are  deviations  from  the  mean  (not  absolute  values 
as  assumed  above),  S{  )  indicates  summation,  i.e. 

8  {xy)  =       +  x^y.  +  . . .  +  x^yN, 
and  i\^=the  total  number  of  cases  (children  measured). 

*  First  suggested  by  Bravais ;  shown  to  be  the  best  measure  by  Karl  Pearson, 
who  cave  it  the  name  of  the  "product-moment  formula." 

A  Bravais:  "Analyse  mathematique  sur  les  probabiht6s  des  erreurs  de 
situation  d'un  point,"  Acad,  des  Sciences;  Memoires  presentes  par  d^vers 

savants,  W  Serie,  ix.  1846,  p.  255.  ■  •   »  d^-/   '/•,.^««  Prt» 

Karl  Pearson:  "Regression,  Heredity,  and  Panmixia,"  Phil.  Uans.  Roy. 

Soc.,  Series  A,  Vol.  olxxxvii.  1896,  pp.  25311. 
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It  is  important  to  note  that  by  starting  from  y  instead  of 
from  and  detevmimng  the  means  of  the  ^/-arrays  (such  as  the 
array  within  the  limits  y.^y'.,  y.yW  another  regression  hne,  GD, 
is  obtained  d;iffe^-ent  from  the  first.    Its  equation  is 


x-x  =  r  —  {y-y), 

f  1! 


and  it  represents  the  law  of  change  of  mean  «;-value  with 
increase  of  y.  It  gives  the  "  most  probable  "  value  of  x  for  any 
particular  value  of  y. 

If  the  series  of  means  do  not  lie  on  a  straight  line  (approx.) 
but  on  a  curve  of  greater  or  less  complexity,  the  above  calcula- 
tion is  meaningless.  In  such  a  case,  called  a  case  of  skew 
correlation  and  non-linear  regression,  the  only  measure  of  the 
correlation  of  the  two  variates  is  that  given  by  77,  the  correlation 
ratio.  V  is  the  ratio  of  the  standard  deviation  of  the  means  of 
the  arrays  (S)  to  the  total  standard  deviation  (of  either  the  xs 
or  the  y's).  Thus  there  are  two  values  of  v,  one  for  the  x%  and 
another  for  the  y's.  They  approximate  closely  to  one  another, 
as  a  rule,  so  that  only  one  need  be  calculated. 

7?  =  —,  or  — . 

When  the  regression  is  linear,  7;  =  r ;  otherwise  77  >  r. 
r  ranges  between  the  values  ±  1,  77  between  0  and  1.  7;  is 
always  positive. 

It  will  now  have  become  clear  that  the  correlation  ratio,  77 
(always)  and  the  correlation  coefficient,  r  (when  regression  is 
linear)  are  measures  of  the  tendency  towards  concomitant 
variation  exhibited  by  two  series  of  phenomena,  and  hence 
throw  some  light  upon  the  causal  relations  of  these  phenomena. 
Exactly  what  kind  of  causal  relation  we  are  justified  in  inferring 
from  them  will  become  clearer  in  the  course  of  the  next  few 
chapters. 

We  may  illustrate  the  significance  of  the  idea  of  correlation 
in  a  slightly  different  (and  more  elementary)  way.  Let  us 
suppose  that  the  200  children  have  been  arranged  in  order  of 
merit,  as  regards  mechanical  memory,  on  the  one  hand,  and  as 
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regards  general  intelligence  on  the  other.  If  now  it  were  found 
that  each  child's  order  was  the  same  in  both,  i.e.  that  the  child 
first  in  mechanical  memory  was  first  in  general  intelligence,  the 
child  second  in  mechanical  memory  was  second  in  general 
intelligence,  and  so  on,  the  correspondence  between  the  two 
series  would  be  complete  and  r  would  be  -f  1.  Or  if,  on  a 
second  supposition,  the  child  first  in  the  one  was  last  in  the 
other,  second  in  the  one  was  next  to  last  in  the  other,  and  so 
on,  the  correspondence  between  the  two  series  would  again  be 
complete,  but  inverse,  and  r  would  be  -  1.  Finally,  if  there  is 
no  correspondence  whatever  between  the  two  series,  r  will  be  0. 
A  value  of  r  between  0  and  +  1  will  express  a  tendency,  greater 
or  less  according  to  r's  size,  for  children  above  the  average  or 
mean  position  in  the  one  ability  to  be  above  the  mean  position 
in  the  other,  and  for  children  below  the  mean  position  in  the 
one  to  be  below  the  mean  position  in  the  other.  A  value  of  r 
between  0  and  —  1  will  express  a  tendency,  greater  or  less 
according  as  r  is  numerically  greater  or  less,  for  the  children 
above  the  mean  position  in  the  one  ability  to  be  below  the  mean 
position  in  the  other,  and  conversely.  Now  if  order  or  "  rank  " 
be  taken  as  an  (inverse)  measure  of  ability,  the  value  of 

8{xy) 


or  r 


becomes  l  -  ^^j^2_  i) ' 

where  S  (  )  is  the  symbol  for  summation,  and  d  is  the 
difference  between  the  rank  of  an  individual  in  the  one  series 
and  his  rank  in  the  other.  This  form  gives  us  a  general 
impression  of  its  appropriateness  for  the  purpose  in  view,  since 
the  greater  the  disparity  between  the  two  series  of  ranks  the 
greater  is  8  (d^)  and  hence  the  smaller  is  r.  If  there  is  no 
relation  at  all  between  the  two  series,  S  (d^)  acquires  the  value 
it  Would  have  according  to  pure  chance,  and  this  can  be  shown 
to  be  =  ^  N{N^-1),  which  makes  the  whole  expression  0,  as  it 
should  do. 

The  one  objection  to  the  formula  is  that  it  assumes  the 
difference  between  any  two  neighbouring  ranks  to  be  equal  at 
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all  parts  of  the  scale.  This  is  obviously  a  false  assumption; 
the  distance  of  individual  from  individual  at  the  two  extreme 
ends  of  the  scale  must  be  considerably  greater  than  that 
between  individuals  near  the  middle.  A  correction  for  this, 
based  on  the  assumption  that  the  form  of  distribution  of  the 
abilities  in  each  of  the  cases  is  Gaussian,  has  been  calculated 
by  Prof.  K.  Pearson.    It  is 

where  p  =  i  — 


At  the  end  of  this  chapter  is  given  a  table  whereby  p-values 
may  be  at  once  converted  into  corresponding  ?--values,  according 
to  Pearson's  equation. 

Finally,  there  is  the  question  of  the  "  probable  error  "  (p.E.). 
Like  every  other  constant  calculated  from  a  limited  sample  of 
variable  material,  the  coefficient  of  correlation  varies  in  value 
from  sample  to  sample,  and  a  measure  is  needed  of  the  limits 
within  which  it  may  be  expected  with  a  fair  degree  of  proba- 
bility to  lie.  This  measure  is  given  by  the  "probable  error." 
In  the  case  of  r  determined  by  the  product-moment  formula, 
when  N  is  sufficiently  large, 

•67449,, 

which  means  that  it  is  an  even  chance  that  the  true  value  of 

r    u  +        .1,   r          ,  -67449 
r  lies  between  the  limits  r  +  r^;:   . 

The  chances  are  16  to  1  against  the  value  falling  outside 
the  limits 

r  ±  3  P.E. 

For  r  determined  by  the  rank-formula,  the  probable  error  is 

slightly  larger,  being  -^Qesq-r^) 

vA'' 

If  A^  the  number  of  cases,  be  small  (say,  less  than  30),  the 
probable  error  is  larger.  Its  exact  size  under  such  conditions 
is  not  known. 
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The  following  is  an  example  of  the  way  in  which  a  correla- 
tion coefficient  may  be  obtained  by  means  of  ranks.  The 
subjects  were  boys  in  the  Fourth  Form  of  a  Public  School,  and 
the  correlation  to  be  obtained  is  that  between  ability  in 
Classics  and  ability  in  Drawing. 


Form  Order 


Classics 

Drawing 

R.  C.  0. 

1 

9 

'9)2=  64 

H.  G.  M. 

2 

2 

* 

0 

B.  L. 

9 

16 

49 

F.  L.  S. 

7 

6 

1 

C.  M.  S. 

3 

15 

144 

C.  J.  L.  H. 

5 

4 

1 

A.  L.  P. 

6 

17 

121 

E.  G.  T. 

4 

3 

1 

F  n  F 

8 

5 

9 

N.  P.  K  N. 

11 

14 

9 

H.  B.  D. 

10 

12 

4 

S.  H.  T. 

14 

7 

49 

H.  B.  M. 

12 

1 

121 

L.  H.  S. 

13 

8 

25 

J.  P.  C. 

15 

10 

25 

E.  W. 

16 

18 

4 

C.  C.  M. 

17 

11 

36 

L.  H.  W. 

18 

13 

25 

E.  M.  J. 

19 

19 

0 

iV^==19 

688 =S  (cJ2) 

p  =  l 

6  X  688 
~       19  x360' 

=  •40, 

.  r  = 

=  2sin(|p)  = 

•416, 

P.E.  = 

7063(l-r-)_ 

VF 

•134. 

r  is  here  just  over  three  times  its  probable  eiTor,  and  we 
might  therefore  feel  inclined  to  conclude  that  it  proves  a  real 
correlation  between  the  two  series.  We  must  remember,  how- 
ever, that  19  is  a  very  small  number  of  cases,  and  that  therefore 
the  real  probable  error  is  considerably  larger  than  that  given  by 
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the  formula.  Hence  the  reality  of  the  correlation  is  not  so 
certain.  Our  caution  is  proved  to  be  justified  when  we  turn  to 
the  next  higher  form,  the  Remove,  and  find  that,  with  the  same 
number  of  boys,  the  correlation  between  Classical  ability  and 
Drawing  ability  works  out  as  --313  (±-14),  quite  a  different 
result.  It  might  be  objected  that  other  factors  than  mere 
smallness  in  the  number  of  cases  were  responsible  for  the 
difference ;  e.g.  that  the  tendency  to  specialise  in  Classics  was 
greater  in  the  Remove  than  in  the  Fourth,  and  that  the 
consequent  neglect  of  Drawing  by  the  abler  boys  lowered  the 
correlation.  To  this  it  may  be  replied,  firstly,  that  the  drawing- 
master  was  the  same  for  both  forms,  and  was  likely  to  get  as 
much  out  of  the  boys  as  possible  in  each  case,  and,  secondly, 
that  the  difference  between  the  two  forms  in  respect  of  the 
degree  of  specialising  tendency  was  insufficient  to  account  for 
the  disparity  of  the  results. 

The  correct  way  to  compare  the  results  mathematically  is 
to  determine  the  probable  error  of  their  difference.  This  =  the 
square  root  of  the  sum  of  the  squares  of  the  probable  errors  of 
each,  i.e. 

P.E.ei_6  =  V  P.E.a'  +  P.E.j^ 

which,  in  this  case,         =  VlS^  + 

=  -19. 

The  difference  =  -416  +  -313  =  -73, 
its  probable  error  =  "19, 
nearly  four  times  the  size  of  the  difference. 

A  very  important  extension  of  the  theory  of  correlation  is 
the  conception  of  "  partial "  correlation.  If,  e.g.,  three  mental 
abilities  are  correlated  with  one  another,  it  is  of  interest  to 
know  how  closely  any  two  of  them  are  correlated  with  one 
another  for  a  constant  value  of  the  third.  Such  a  coefficient  is 
written,  in  Yule's  notation,  r,o.3,  and  measures  the  closeness  of 
mterrelation  of  the  first  two  abilities  independent  of  the 
relation  between  them  due  to  their  respectivea-elations  to  the 
third  ability.  ^  '  •    ^.  ' 
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This  may  be  illustrated  from  our  example  by  taking  the 
form-order  for  English  into  consideration  in  addition  to  that  for 
Classics  and  for  Drawing.  The  correlation  between  Classics 
and  English  works  out  as  "78,  that  between  Drawing  and 
English,  as  '21, 

Then  the  correlation  between  Classics  and  Drawing  for 
"  English  constant " 

rcD-rcETDS 

•42  - -78  X -21 
~  \/(l-^78^)(l-^2P) 
=  •42. 

Thus,  in  this  particular  case,  the  "partial"  coefficient  is 
practically  identical  with  the  "  entire  "  coefficient,  showing  that 
the  relation  between  Classics  and  Drawing  is  practically 
independent  of  their  common  relation  to  English.  Calculating 
the  other  two  partial  coefficients,  we  get 

The  only  coefficient  showing  an  appreciable  difference 
between  entire  and  partial  correlation  is  that  between  English 
and  Drawing:  entire  =  "21,  partial  =  '58.  This  result  means 
that  that  part  of  ability  in  English  which  is  not  identical  with 
ability  in  Classics  is  related  more  closely  to  that  part  of  abihty 
in  Drawing  which  is  not  identical  with  ability  in  Classics  than 
is  entire  English  ability  to  entire  Drawing  ability. 

The  result  might  be  illustrated  by  the  intersection  of  three 
spheres  representing  Classical,  Drawing,  and  English  ability 
respectively,  after  the  analogy  of  Euler's  circles  in  Logic,  the 
volume  common  to  any  two  of  the  spheres  being  proportional 
to  the  "  entire  "  correlation  coefficient  for  the  two  abilities,  and 
the  volume  common  to  the  same  two  exckisive  of  any  part  also 
common  to  the  third  sphere  being  proportional  to  the  "partial" 
correlation  coefficient  for  the  two  abilities.  This  cannot,  of 
course,  be  taken  as  any  more  than  an  illustration,  since  the 
exact  significance  of  the  correlation  coefficient,  in  this  sense,  is 
not  known  (see  below  p.  80). 
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The  principle  of  partial  correlation  can  be  extended  to 
include  an  indefinite  number  of  variables,  and  general  formulae 
for  this  purpose  will  be  given  in  the  next  chapter. 

It  will  be  obvious  that  the  same  formulae  may  be  employed 
for  the  elimination  of  "  irrelevant "  factors,  e.g.  age,  sex,  etc. 


Table  for  converting  p  into  r  (r  =  2  sin  -  p)*. 


p 

r 

P 

r 

P 

r 

P 

r 

•05 

•052 

•30 

•313 

•55 

•568 

•80 

•813 

•10 

•105 

•35 

•364 

•60 

•618 

•85 

•861 

•15 

•157 

•40 

•416 

•65 

•668 

•90 

•908 

•20 

•209 

•45 

•467 

•70 

•717 

•95 

•954 

•25 

•261 

•50 

•518 

•75 

•765 

1^00 

1-000 

*  Quoted  from  K.  Pearson,  Drapers'  Company  Research  Memoirs,  Biometric 
Series  iv.  1907,  p.  18. 


CHAPTER  I* 


THE  MATHEMATICAL  THEOEY  OF  CORRELATION 

In  the  present  chapter  an  attempt  will  be  made  to  summarize 
briefly  the  principal  methods  in  use  for  obtaining  a  measure  of 
the  correlation,  or  tendency  towards  concomitant  variation,  of 
two  or  more  variates. 

Let  the  coordinates  of  the  dots  in  the  accompanying  diagram 
— commonly  known  as  a  "  scatter  diagram " — represent  the 
measures  of  two  separate  characteristics,  e.g.  speed  of  adding 
figures  (x)  and  accuracy  of  adding  figures  (y),  in  a  number  of 
individuals  (N). 


O  speed  of  adding  (x) 


Fig.  11. 

liCt  the  crosses  represent  the  mean  values  of  y  corresponding 
to  values  of  x  lying  between  the  limits  of  pairs  of  successive  units 
of  measurement.    Then  the  broken  curve  CO'  passing  through 

*  The  proofs  of  the  formulae  for  r,  tj,  and  the  "two-row table  "  to  be  found 
in  this  chapter  are  abbreviations  of  those  given  by  Professor  Karl  Pearson  in 
hie  lectures. 
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these  crosses  represents  the  most  probable  law  of  relationship 
between  speed  of  adding  and  accuracy  of  adding,  and  is  known 
as  the  regression  curve. 

I.    Correlation  coefficient  (r). 

Let  us  now  find  the  "best  fitting"  straight  line,  LL',  to 
this  curve.  To  do  so  we  apply  the  method  of  least  squares, 
merely  from  motives  of  convenience ;  i.e.  we  choose  a  line  such 
that  the  sum  of  the  squares  of  all  distances  like  PM,  each 
weighted  with  the  number  of  cases  {n^^  from  which  the  mean 
value  of  the  corresponding  array  of  y's  was  determined,  is  a 
minimum. 

Let  ordinate  of  P  be  F  and  ordinate  of  M  be  (the  mean 
of  an  array). 

Then  >S  {wj,  (F—  y^Y]  =  V  (say),  which  is  to  be  a  minimum. 
Let  equation  to  line  LL'  be  F=  Ax  +  B. 

Then  V  =  S{n^{Aw  +  B-%y}   («). 

For  V  a  minimum,  this  gives* 
S(a;y)  _ 

A=JL  _S{x-x)(y-y) 

cr,'  No-,' 
and  also  y  =  Ax  +  B 

{x,  y  being  the  mean  values  of  the  x's  and  y's  respectively,  and 
o-j,  0-2  their  standard  deviations). 

*  V=S {n,, {Ax  +  B~  y^)^}  =A\S  (n^x^)  +B^.S (nj  +  S  {n^y/)  +  2AB.S (n^x) 

-2A.S{n.,xy^)-2B.S(n^y^), 
.-.  ^  =  42      +  52)  +  £2  +  +  2ABX-2A.  ^h^)  _  2By. 

For  this  to  be  a  minimum,  —(-^      ^  <^ 

^fv^  d2/F  '^^^ 

dA 


\n)  'dB\N)  positive. 


(^)  "^"^^          ^®  and 


U(<ri2  +  52)  +  j3j_^i!^)^0   ^^^^ 

[and  B  +  Ax-y  =  0   (2). 

Solving  these  equations  for  A  and  B,  we  have 

S(xy) 
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Therefore  the  line  passes  through  the  means  of  the  two 
values,  and  its  equation  is 

y-y  =  A{x-x). 
Let  us  define  r  as       =  ^(^-^)(3/-^) 

Then 

and  the  equation  to  the  best-fitting  straight  line,  LL\  is 

y-y  =  r'^{x-x)   (^). 

LL'  is  known  as  the  regression  line,  and  r  —  is  called  the 

coefficient  of  regression  of  y  on  a;,  being  the  tangent  of  the  angle 
which  this  line  makes  with  the  axis  of  x. 

Let  y-  F  measure  the  distance  of  any  individual  point  from 
this  line.  Then  the  average  of  the  sum  of  the  squares  of  all 
such  distances 

^  8{y-Yf 

N  '  ' 

_S{y-y  -A  (x  -  x)\-* 
-  ]v 

=  0-2^  +  AW  -  2Ara;a2 
=  a,^l-r-). 

Hence  the  standard  error  (standard  deviation)  made  in 
estimating  the  value  of  y  most  probably  associated  with  any 
particular  value  of  x  from  equation  (/S)  is 

O-aVl-r^  (ry). 

r  is  known  as  the  coefficient  of  correlation,  and  evidently  must 
lie  between  the  values  + 1  and  —  1.  If  the  regression  line 
coincides  with  the  regression  curve,  within  the  limits  of  errors 
of  random  sampling, — in  other  words,  if  the  regression  is  linear 
— r  is  a  measure  of  the  degree  of  dependence  between  x  and  y. 
When  r  =  +  1,  the  points  close  up  upon  the  line  and  the 
"  scatter  diagram  "  contracts  to  become  the  line  itself. 

*  Y  is  the  y  of  equation  (/3),  and  therefore  =  y  +  A  (x-x). 
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If  X  and  y  be  measured  from  their  mean  values,  instead  of 
from  zero  or  some  other  arbitrary  origin,  we  get 

S{xy) 


r  = 


and  the  regression  of  y  on  ai  is  given  by  the  equation 


y  =  r  —  X 


.(8). 


An  analogous  equation,  x  =  r  —  y,  gives  the  regression  of  x  on 

f  2 

y.  There  are  thus  two  regression  lines.  The  coefficients  of 
regression,  r  —  and  /■  — ,  are  conveniently  denoted  by  the  symbols 

6i2  and  &21  respectively. 

If  X  and  y  be  measured  from  their  means  and  also  in  terms 
of  their  respective  standard  deviations  as  unity,  the  regression 
equations  become 

x  =  ry  and  y  =  rx, 

and  r  is  then  itself  the  coefficient  of  regression  of  y  on  a;  and  of 
X  on  y,  the  two  regressions  being  equal. 

II.    Correlation  Ratio  (??)*. 

It  is  clear  that  if  the  regression  is  not  linear  r  ceases  to  be 
a  satisfactory  measure  of  the  relation  between  the  two  characters 
under  consideration.    In  an  extreme  case,  such  as  that  shown 


— \^ 

'•/.*.• 


Pig.  12. 

See  Drapen'  Covipany  Research  Memoirs,  Biometric  Series  11.  pp.  9  et  seq. 
Karl  Pearson  "On  the  Theory  of  Skew-Correlation  and  Non-Linear  Regression." 
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in  the  accompanying  diagram,  r  may  be  2:ero  while  there  is  yet 
a  very  close  relation  between  the  two  characters. 

Let  y  —yx  he  the  distance  of  any  point  of  the  "  scatter 
diagram  "  from  the  regression  curve.  Then  the  average  of  the 
sum  of  the  squares  of  all  such  distances,  each  a;-array  of  t/'s 
being  weighted  with  the  number  of  individuals  {n^  in  it, 

_  <S  [n^  {y  -  y^T] 
N 

and  this  expression,  on  reduction, 

=  <ri  -  2^ 

where  S  =  the  standard  deviation  of  the  means  of  the  arrays, 
each  array  being  weighted  with  the  number  in  it ; 

t 


Let  v 


cr. 


2 


Then 

and  the  mean  square  of  the  distance  of  points  from  the  regres- 
sion curve 

=  (T,^l-V'   (^)- 

Comparing  expressions  (e)  and  (7),  we  see  that  rj  is  &  test  of 
the  linearity  of  the  regression.  If  ?7  =  r,  within  the  limits  of 
errors  of  random  sampling,  the  regression  is  linear. 

Since  v  is  the  ratio  of  two  standard  deviations,  it  nmst 
always  be  positive,  and  from  (e)  we  see  that  it  cannot  be  greater 
than  1. 

Let  Y  be  the  ordinate  of  any  point  on  the  regression  line. 
Then  the  average  of  the  sum  of  the  weighted  squares  of  the 
distances  between  the  regression  line  and  the  regression  curve 

S  [n,  {y,  -  Yf} 

-  N  ' 

which  reduces  to  /y\ 

<r,'{v'-r"')   (0. 
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Thus  Tj  must  always  be  numerically  greater  than  r,  except 
in  the  case  of  linear  regression,  when  it  is  numerically  equal 
to  r. 

In  examining  the  relationship  between  two  measurable 
characters,  rj  should  be  calculated  as  well  as  r,  since  it  serves  as 
a  test  of  the  linearity  or  non-linearity  of  the  regression,  and  is 
also  a  better  measure  of  causal  relation  than  r. 

A  simple  criterion  for  linearity  which  is  very  generally 
applicable  is  that 

For  very  exact  work,  more  complicated  formulae  need  to  be 
employed. 

The  results  obtained  above  are  all  independent  of  the  forms 
of  distribution  of  the  variates. 

In  Appendix  II.  will  be  found  an  example  of  the  evaluation 
of  r  and  of  r)  between  speed  of  adding  single  digits  and  accuracy 
in  doing  so,  the  individuals  measured  being  86  boys  between 
the  ages  of  11  and  12  years  from  two  L.C.C.  elementary  schools. 
The  two  groups  could  be  thrown  together  for  this  purpose,  since 
the  means  and  standard  deviations  calculated  from  them 
separately  were  in  very  close  agreement — well  within  the 
limits  of  the  probable  errors.  The  results  will  be  discussed  in 
Chapter  III. 


III.    Probable  Errors. 

In  determining  means,  standard  deviations,  and  other  fre- 
quency constants,  the  investigator  is  unable  to  work  from  the 
"total  population"  and  must  be  content  with  the  results 
obtained  from  "  random  samples  "  of  greater  or  less  size  taken 
from  this  (in  some  cases,  hypothetical)  total  population.  Evidently 

*  J.  Blakeman  :  Biometrika,  Vol.  rv.  pp.  349,  3^0. 
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the  results  obtained  from  different  random  samples  will  not  as 
a  rule  coincide,  and  some  measure  is  needed  of  the  variability 
of  these  results  about  their  average,  which  latter  quantity,  for 
a  large  number  of  samples,  may  be  regarded  as  coinciding 
approximately  with  the  value  of  the  frequency  constant  for  the 
total  population.  Such  a  measure  is  given  by  the  standard 
deviation,  o-,  of  the  various  results  obtained  from  the  different 


samples,  but,  since  the  theory  of  probable  errors  was  originally 
developed  on  the  assumption  of  the  "  normal  "  or  Gaussian  form 
of  distribution,  the  value  universally  taken  is  "67449  x  cr,  which 
is  conventionally  termed  the  probable  error  (p.e.).  In  the 
normal  curve  (see  Fig.  13)  it  is  the  distance  OP,  the  position  of 
P  being  such  that  the  ordinate  PP'  divides  the  positive  half 
of  the  curve  into  two  equal  areas.  Thus,  if  the  total  area 
of  the  curve  is  taken  as  unity,  the  value  of  p.e.  is  given  by  the 
equation 

la;-! 

/•P.E.      1  -2a^ 

j^=—  e      dx  =  0"25, 
J  0    V  27ro- 

whence  p.e.  =  •67449o-. 

In  the  language  of  chance,  a  deviation  of  +  p.e.  is  as  likely 
to  occur  as  not,  when  the  distribution  of  frequencies  is  "normal." 
The  value  of  a  frequency  constant  determined  from  a  random 
sample  may  be  considered  fairly  i-eliable  within  the  limits  of 
+  (2 — 3  times  its  p.e.). 
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When  the  number  of  cases  (n)  in  the  random  sample  is 
fairly  large— so  large  that  fractions  containing  certain  higher 
powers  of /i  in  the  denominator  can  be  neglected— the  probable 
en-ors  are  found  to  be  as  follows*  : 


cr 

P.E,  of  a  mean  =  '67449  -j=-  > 


.r  = -67449^ 
r= -67449 


1 


The  second  and  third  of  these  values  are  only  correct  when 
the  frequency  distribution  is  normal  or  approximately  normal. 
In  particular,  for  large  values  of  r  the  true  P.E.  may  be  con- 
siderably different  from  that  given  by  the  above  formula  unless 
the  distribution  is  normal. 

P.E.  of  7/  =  -67449  ^^-^^ ,  for  linear  regression,  and  also,  as  a 
vn 

rough  measure,  for  cases  of  skew-correlation.  If  greater  exacti- 
tude is  needed  in  the  latter  cases,  more  complicated  formulae 
have  to  be  employed  f. 

Another  frequency  constant  in  common  use  is  the  coefficient 

of  variation  V,  which  =  , 

mean 

Its  P.E.       =-67449f|i -F2^^y|Y\/2^J. 

As  stated  above,  the  values  just  given  for  the  probable 
errors  only  apply  in  cases  where  n  is  fairly  large.  In  cases 
where  n  is  so  small  that  certain  higher  powers  of  its  reciprocal 
cannot  be  neglected  in  comparison  with  the  rest  of  the  expres- 
sions involving  them,  the  values  cannot  be  used.  For  such 
cases  no  theoretical  formulae  have  hitherto  been  devised. 

*  See  W.  Gibson:  "Tables  for  Facilitating  the  Computation  of  Probable 
Errors,"  Biomctrika,  Vol.  iv.  pp.  385  et  seq. 

t  Karl  Pearson  :  op.  cit.  (Biometric  Series  n.)  p.  19. 

X  Calculated  values  for  different  values  of  n  given  in  Gibson's  Tables,  just 
quoted. 
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Quite  recently,  however,  an  empirical  investigation  has  been 
made  on  samples  of  4,  8,  and  30  cases,  taken  from  a  "total 
population"  of  3000  pairs  of  measurements  (height  and  left 
middle  finger  measurements  of  8000  criminals;  "real"  correlation, 
•66)*.  In  the  case  of  correlation  it  was  found  that  when  the 
original  pairs  of  measurements  were  arranged  so  that  the  "real" 
correlation  was  0,  the  distribution  of  the  values  of  r  obtained 
from  samples  of  n  was  given  approximately  by  the  formula 


n-i 


2/  =  2/o(l  -x')  ^  • 

Five  distributions  in  all  were  worked  out,  giving  the  follow- 
ing moment  coefficients : 


Samples  of  4  {r  =  0) 
8  (r  =  0) 
4(r  =  -66) 
8(r=-66) 
„        30(r  =  -66) 


Mean 


•5609 
•6139 
•661 


S.D. 


•5512 
•3731 
•4680 
■2684 
•1001 


•3038 

•1392 

•2190 

•07202 

•01003 


Ms 


•1570 

•02634 

•000882 


M4 


•1768 

•0454 

•2152 

■02714 

•000461 


^1 


2-245 
1-857 
-7713 


P-2 


1-918 
2.336 

4-  489 

5-  2.32 
4-580 


[Unit  of  grouping  for  4's  and  8's  =  ^04.] 

Correlation  results,  for  real  value  of  r  =  -66,  were 

samples  of  4          -561   ±  "Oil, 

„    8           -614  +  -065, 

„  30           -6609  ±  •0067. 

Hence  it  may  be  concluded  that,  although  in  the  case  of 
such  small  samples  as  4  or  8  the  ordinary  formula  for  the  p.e. 
of  r  gives  much  too  low  a  value,  yet  in  the  case  of  as  many  as 
30,  the  formula  applies  with  tolerable  accuracy.  We  must, 
however,  bear  in  mind  that  this  result  has  only  been  proved 
(empirically)  to  hold  in  the  single  case  where  the  actual 
correlation  was  '66. 

♦  "Student"  :  "The  Probable  Error  of  a  Coefficient  of  Correlation,"  Bio- 
metriha,  Vol.  vi.  PP-  302-310.  1908-1909. 
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IV.    Multiple  Correlation*. 

The  regression  equation  for  the  two  variates,  x  and  y,  both 
measured  from  their  means,  was  found,  on  p.  57,  to  be  of  the  form 

a?  =  &i2y, 

the  equation  being  obtained  by  an  application  of  the  method  of 
least  squares,  on  the  assumption  that  the  regression  was  linear. 

It  is  evident  that  the  method  thus  applied  to  the  case  of 
two  variates  might,  without  further  assumptions,  be  used  in 
the  general  case  of  n  variates.  Thus,  if  denote 
deviations  from  means,  the  equation  expressing  the  regression 
of  could  be  written 

"In  this  notation  the  suffix  of  each  regression  coefficient 
completely  defines  it.  The  first  subscript  gives  the  dependent 
variable,  the  second  the  variable  of  which  the  given  regression 
is  the  coefficient,  and  the  subscripts  after  the  period  show  the 
remaining  independent  variables  which  enter  into  the  equation." 
(Yule,.o;).  ext.  p.  182.) 

In  terms  of  the  same  notation,  write 

^'l2.  34  ...  n  =  (^12.  34 ...  n  •  ^21 .  34 ...   

23  ...  n  =       -  (&12.  34 ...  71  «2  +  ^13.  24  ...  n       +  .  . .  +  ^m.  23  ...  n-\  Xn)"  .  (7), 
^■0-\.^...n  =  S(x\,.^^^^)  (g). 

Then  the  normal  equations,  from  which  the  regressions  are 
determined  by  the  method  of  least  squares,  may  be  written 
S {^2  ■  a-i.  23 ...n)  =  S(x3.     23 ... ,,)=...=  ^      .     ,3... ,J  =  0. .  .(e). 

From  these  equations  Mr  Yule  has  deduced  the  following 
results : 

^I2.34...n  =  na.3,   (y\ 

0-2.34...n 

a^.,„,...„=ar(l-rV,)(l-r^,,,)(l-r^,23)...(l-r\,,,3...,_0.  (^), 


■(0). 


f  12. 34... 11  =  "-1  .  ■■M . . .  71-1  ^'2» .  34 . ■ . n-1 

(l-^^7^.34...n-0*(l-r\,.3,...,_0*' 

•  See  G  Udny  Yule:  "On  the  Theory  of  Correlation  for  any  Number  of 
pp  t  "iWT^^^^^^  a  New  System  of  Notation."  Proc.  Eoy.  LyT^I 
account  '     ^^'"^  '^^^^^^  ^  '"^^'^  condensed 
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^i2.w...n  is  known  as  a  "  partial "  correlation  coefficient,  being  the 
value  of  the  correlation  between  1  and  2  for  constant  values  of 
3,  4  ...  n.  Similarly,  612. 34... n  is  a  "partial"  regression  coefficient 
(cf  "partial  differentiation"  in  the  Differential  Calculus). 
Knowing  the  "  total "  correlations  r,,,  r^,  etc.,  equation  {d) 
enables  us  to  obtain  the  various  partial  coefficients  by  successive 
substitutions.    Thus,  in  the  case  of  3  variables,  1,  2,  3,  we  have 

„        _  ^12  ~  ^13  ^23 

'12.3  —  -    /  , 

and  two  similar  equations,  expressing  the  value  of  the  correla- 
tion between  two  of  the  variables  for  a  constant  value  of  the 
third*.  If  a  fourth  variable  be  added,  we  have  the  further 
set  of  equations 

„      _        ^12. 8  ~  *^4.  3  ^"24. 3 

'  12. 34  1  r  • 

Perhaps  a  more  convenient  formula  for  obtaining  the  partial 
correlation  of  two  variables  for  constant  values  of  a  third  and 
fourth  is 

'  1-1. 34  —  ~,  ^='^=^=-^==    , —  = . 

V 1  -  7-'\s  -  r\i  -  r\i  +  2ri3  r,4  7-34  V 1  -  ?    -     -  7%  +  7 

When  the  partial  correlation  coefficients  have  been  deter- 
mined, the  regressions  can  be  found  by  substituting  in  equations 
(77)  and  (^),  and  these  give  at  once  the  regression  equations. 
As  explained  before,  a  regression  equation  gives  the  most 
probable  value  of  one  variable  for  given  values  of  the  remaining 
variables,  the  standard  error  in  such  a  prediction  being  o'i..)3...,i 
etc. 

An  example  of  the  method  of  applpng  the  above  formulae 
is  given  in  Appendix  III.,  where  the  partial  correlations,  regres- 
sions, etc.  in  the  case  of  four  interrelated  psychical  capacities 
are  worked  out  on  lines  identical  with  those  illustrated  by 
Mr  Yule  in  his  paper. 

*  For  an  interesting  representation  of  correlation  between  three  variables 
by  a  model  showing  the  distribution  of  points  in  space,  see  G.  Udny  Yule,  "An 
Introduction  to  the  Theory  of  Statistics,"  C.  Griffin  and  Co.,  London,  1911, 
pp.  2il-243. 


THE  MATHEMATICAL  THEORY  OF  CORRELATION  65 

The  probable  error  of  a  partial  correlation  coefficient  n^.u-.n 

1  —  '''"l2.34...n 


=  -67449 


n 


I.e.  IS 


similar  in  form  to  that  for  a  "  total "  coefficient. 


V.    Other  methods  for  determining  Correlation. 

1.    Four-fold  Table. 


X2 

a 
• 

b 

a  +  b 

c 

d 

c  +  d 

b  +  d 

(a)  When  the  divisions  pass  through  the  means  of  both 
characters 

.  Tr/a  —  b^ 


r  =  sm  - ,  , 
2  \a  +  oj 

This  formula  (Sheppard's)  is  of  little  practical  use,  since  the 
mean  values,  in  cases  where  the  four-fold  table  is  the  only 
method  which  can  be  used,  are  generally  unknown. 

(6)  In  cases  where  the  dividing  lines  do  not  pass  through 
the  means  (means  not  known),  the  equations  for  determining 
the  correlation  are : 

(a +  6) -(c  +  d)       /2       -hv'  , 
 N  ^y-^]' 


*  All  this  is  rendered  much  easier  by  P.  F.  Everitt's  Tables  of  Tetrachoric 
Functions  just  published  in  Biometrika  (Vol.  vn.  pp.  437-101). 
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from  which  h  and  k  can  be  obtained  by  Sheppard's  Tables ;  and 
finally 

ad  — be  , ,        .,  _ 

+  ^  {h'  -  Qh'  +  3)  (k'  -  6P  +  3) 

+  f^h  {h'  -  iOh'  +  15)k  (k'  -  lot-  +  15) 

+  (h'  -  15h'  +  4>5h'  -  15)  (t  -  15k'  +  4<5k-  -  15) 
+  etc., 

^   e-i'^  and  K=-jLe-^'\ 


where  H-^  ,  ,  ^  

V27r  V27r 

In  obtaining  these  equations,  abnormal  correlation  surface 
is  assumed  ;  equation  : 


z  = 


N 


1  _J_  /        2rxy  \ 


27rcricr2  Vl  — 

The  probable  error  of  r  obtained  by  the  four-fold  table 
method  is  much  larger  than  that  given  by  the  formula  of  p.  61. 
The  correct  formula  is  too  complicated  to  insert  here, 

2.    Method  of  Contingency*. 

The  following  is  an  example  of  a  contingency  table  f. 

Fathers. 


Merry 

Melancholy 

Alternating 

Even 

Totals 

Merry 

122 

8 

81 

67 

278 

Melancholy 

10 

2 

7 

10 

29 

Alternating 

70 

9 

101 

68 

248 

Even 

58 

6 

66 

45 

175 

Totals  ... 

260 

25 

255 

190 

730 

CO 
i3 
O 

CO 


*  Karl  Pearson:  "On  the  Theory  of  Contingency  and  its  Belation  to 
Association  and  Normal  Correlation,"  Drapers'  Company  Research  Memoirs, 
Biometric  Series  i.  1904;  Dulan  and  Co.,  London. 

t  Taken fromapaperbyE.  Schuster andE.  M.Eldertonon  "The  Inheritance 
of  Psychical  Characters  (being  a  further  Statistical  Treatment  of  Material 
Collected  and  Analysed  by  Messrs  G.  Heymans  and  E.  Wiersma),"  Biometrika, 
Vol.  V.  1906-1907,  pp.  460-469. 
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The  method  is  employed  when  the  grouping  is  merely  by 
class  and  the  different  classes  have  no  quantitative  relation  to 
one  'another,— in  other  words,  when  the  grouping  is  merely 
qualitative.  The  relationship  between  the  two  variables  is 
measured  by  the  differences  between  the  numbers  actually 
found  in  the  various  compartments  of  the  table  and  the 
numbers  that  might  be  expected  there  by  pure  chance. 

To  state  the  rule:  ! 

The  total  mean  square  contingency,  <^',  of  the  table  is  given 

by 


9  —^^Pi 


np'tlq 


T 


where 

Up  =  total  frequency  in  pth.  row. 


Vg  =    „         „         „  qth.  column, 
npg  =  frequency  of  constituent  common  to  pth  row  and  qth.  column, 
N  =  total  frequency  of  table. 

Then  the  coefficient  of  mean-square  contingency. 

In  the  above  case  of  fathers  and  sons 

c  =  0-16. 

The  probable  error  of  c  is  very  complicated f. 

•6744.9 

For  c  =  0,  P.E.  =     ,—  . 

3.    Two-row  Table.    (New  method  |.) 

This  method,  recently  devised  by  Professor  K.  Pearson, 
gives  a  unique  value  of  r  in  the  case  of  two  variates  one  of 
which  is  both  quantitative  and  continuous  (e.g.  intelligence) 

*  There  are  certain  corrections  on  not  mentioned  here,  which  often  make 
much  difference  to  the  result. 

t  J.  Blakeman  and  Karl  Pearson  :  "On  the  Probable  Error  of  Mean-Square 
Contingency,"  Biometrihi,  Vol.  v.  pp.  191-197,  1906. 

t  Karl  Pearson  :  Biometriica,  Vol.  vii.  1909,  p.  97, 
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while  the  other,  though  quantitative,  admits  of  only  two 
subdivisions  (e.g.  into  good  and  bad  visualisers),  or,  in  more 
technical  language,  is  "  alternative." 


-*■  X  (intelligence). 


Good 

visualisers 

Bad 

visualisers 

N 

The  assumptions  made  are  two  in  number: 

(1)  that  the  regression  is  linear: 

(2)  that  the  distribution  of  the  alternative  variate  is 
approximately  normal  or  Gaussian. 

The  accompanying  diagram  (Fig.  14)  will  make  the  method 
clearer.    LL'  is  the  regression  line  of  x  on  y,  its  equation  being 

x-x  =  r  —  {y-y), 
x  —  x 

- 

Putting  X,  the  mean  of  all  the  x's-^p,  x  =  p',  the  mean 
value  of  the  array  of  xs  corresponding  to  the  smaller  group 
of  3/'s  (good  visualisers),  and  y-y  =  q,  the  distance  of  the 
centroid  of  the  area  EST  from  the  mean  MM  of  the  Gaussian 
curve,  we  have  the  formula: 


P-P' 
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The  numerator  of  this  is  known,  and  the  denominator 

TOO  i£! 

J  a 


69 


cTa        X  area  of  part  KST 


N  1  • 
nt  V27r 


where 
and 


lit  =  no.  of  cases  in  upper  group  (area  RST), 
N  =  total  no.  of  cases. 


I 

y 

Good 

visualisers 


Bad 

visualisers 


— >x 

L  \ 

<  />'  

T  T 

—  ^  

i  ?  > 

i  1 

\ 

\ 

\ 

\ 

\ 

\ 

\ 

Fig. 

14. 

M 


-iM 


IXt  1 

In  this  expression  ]y^  =  2^^~°'^  Sheppard's  Tables 
{Biometrika,  Vol.  ii.  p.  182)  from  which  the  corresponding 

value  of  —  can  be  read;  and  the  value  of  z  corresponding  to  this 
0-2 

latter  result  gives 

1  A&. 


V27r 


•.  r  = 


0-2/ 

P-P' 


^     /the  value  of  z  in  Sh.'s  Tables  correspondingX  ' 


nt 


to  the  ^  {I -a)  of  ^ 


The  above  method  has  not  yet  been  applied  in  psychology, 
but  might  advantageously  be  used  in  determining  the  relation- 
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ship  of  two  characters,  such  as  those  suggested  (intelligence  and 
visual  imagery),  which  are  both  quantitative,  but  of  which  one 
only  admits  of  reliable  division  into  two  groups.  It  is  the 
smaller  of  these  groups  (the  "  tail "  of  the  distribution)  which 
gives  the  tit  of  the  formula.  In  the  suggested  case,  it  might  be 
represented  by  the  number  of  individuals,  out  of  the  entire 
group  (JSf)  measured,  who  successfully  pass  certain  somewhat 
difficult  tests  of  visual  imagery. 

In  cases  where  the  a;-variate  (continuous  and  quantitative 
in  our  example  above)  can  only  be  divided  into  classes,  showing 
no  definite  order  or  quantitative  relations  to  one  another,  the 
2/-variate  being  again  quantitative  and  assumed  to  follow  a 
Gaussian  distribution,  but  alternative,  a  modification  of  the 
above  method  gives  rj.  Neither  r  nor  the  contingency  method 
is  applicable  in  such  cases. 

4.    Short  Methods*. 

(i)  It  can  be  easily  shown  that 

and  therefore  that 

_       +  cTy-  —  (r'^x-y 
'^^^  -         2a,  ay 

(ii)  If  the  distributions  are  both  normal,  and  if  both 
variates  have  the  same  mean,  m,  and  the  same  standard 
deviation,  a,  then  ■     

_       TT  {S  {m  + x—m  +  y)Y 
N'a"' 

where  S(m  + cc  -  m +  y)  is  the  sum  of  the  positive  differences 
only.    In  Professor  Pearson's  words,  the  rule  is : 

"  Subtract  from  unity  tt  times  the  square  of  the  mean  sura 
of  the  positive  differences  of  paired  variates  divided  by  their 
common  standard  deviation." 

This  method  might  sometimes  be  conveniently  used  m 
determining  the  "individual"  correlation  (see  below  p.  86) 

»  Karl  Pearson:    "On  Further  Methods  of  Determining  Correlation," 
Drapers'  Company  Research  Memoirs,  Biometric  Series  iv.  1907. 
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between  performances  of  the  same  individuals  in  the  same 
mental  tests  on  different  occasions. 

(iii)    The  Method  of  Ranks. 

It  has  recently  been  suggested  by  Dr  C.  Spearman*  that 
measurements  of  psychical  performance  may  convemently-nay 
preferably-be  replaced  by  the  numbers  representing  the  rank 
or  order  of  merit  of  the  individuals  in  the  group.   On  this  basis 
the  ordinary  product-moment  formula  for  r,  ,  can  be  easily 

shown  to  reduce  to  the  form 

QS  (v,  -  v.r   

where  v,  and  v,  are  the  ranks  of  an  individual  in  the  two  series. 

Dr  Spearman  has  suggested  a  still  simpler  formula,  which 
he  calls  a  "  foot-rule  "  formula.    It  is 

where  S(g)  denotes  the  sum  of  the  "gains"  in  rank  (sum  of 
positive  differences)  of  the  second  series  on  the  first,  and 
^(JV2_  1)  is  the  value  of  the  sum  of  such  gains  which  may  be 
expected  by  chance. 

In  order  to  make  the  determination  of  correlation  by  R 
comparable  with  that  obtained  by  the  ordinary  formula— 
presumably,  in  Dr  Spearman's  view,  formula  (a)— he  has 
suggested  the  formula 


r^sinl-^-R)  (7)- 


This  method  of  using  ranks,  and  the  formulae  suggested 
therefor,  were  vigoi-ously  criticised  by  Karl  Pearson  in  the 
paper  quoted  on  the  preceding  page. 

*  0.  Spearman:  "Measurement  of  Association  between  Two  Things,"  Am. 
J.  P.  Vol.  XV.  1904;  "  'Foot-rule'  tor  Measuring  Correlation,"  Brit.  Journ.  of 
Psychology,  Vol.  11.  Pt.  i.  July  1906. 
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Pearsons  Criticisms. 

(a)  The  direct  use  of  ranks,  or  order  in  a  series,  as  a 
quantitative  measure  of  character,  is  inadmissible.  It  assumes 
that  the  unit  of  rank  is  equal  throughout  the  scale.  But  this 
is  very  obviously  not  the  case.  "  Between  mediocrities  the  unit 
of  rank  treated  as  a  measure  of  a  variate  is  practically  zero, 
between  extreme  individuals  it  is  very  large  indeed."  Some 
form  of  frequency  distribution  must  be  assumed,  and  the  method 
of  ranks  assumes  that  form  to  be  a  rectangle.  The  assumption 
made  could  hardly  be  a  more  improbable  one.  On  the 
assumption  of  "  normal "  or  Gaussian  distribution,  Pearson 
shows  that  the  correlation  of  grades  (not  quite  the  same  thing 
as  ranks)  is 


r  =  2sin\^p 


where  /a  =  1  -  '       expression  (a)  above. 

The  grade  of  an  individual  in  a  group  is  measured  by  the 
number  of  individuals  above  him,  represented  by  the  shaded 
portion  of  the  following  normal  curve,  thus  : 


Fig.  15. 


Hence  the  quantities  to  be  correlated  are 


and 
giving 


a;-grade,  gi  =  ^N'+ 


TTCTj  J  0 


N 


1Z2 


1 

a  o-o- 


.-grade,     =      +  '-cZ,, 
^-2  sin  i-  Fl-^^  i^^-^^y]\   (aandS), 
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In  terms  of  the  sum  of  positive  differences  of  ranks  ("gains" 
in  rank)  the  formula  is 

r  =  2cos2,r||i§^|-l 
=  2eos5(l-K)-l   (e). 

o 

This,  then,  is  the  formula  which  should  take  the  place  of 
Spearman's  formula  (7). 

(b)  With  small  values  of  N,  R  retains  the  same  value  for 
wide  variations  in  p  [expression  (a)  above]. 

•43 

(c)  Spearman  obtains  the  value  — =  for  the  probable  eiTor 

of  R  for  zero  correlation,  empirically,  by  noting  the  distribution 
of  a  large  number  of  chance  values  of  S(g)*.  In  the  case  of 
some  of  these,  the  corresponding  values  of  R  are  negative. 
Now  the  negative  values  of  R  only  range  from  0  to  —  '5.  Hence 
the  value  which  he  deduces  for  the  probable  error,  being  that  of 
a  quantity  ranging  from  +  1  to  —  '5,  is  not  comparable  with  the 
probable  error  for  the  product-moment  formula  (range,  + 1 
to  —  1).  The  comparative  accuracy  of  the  different  methods  may 
be  deduced  from  Pearson's  formulae  by  using  differentials,  and 
Pearson  has  proved  that  the  probable  error  of  p  for  zero 
•67449 

correlation  =  .    He  is  thus  able  to  show  that,  for  zero 


correlation. 


V?i- 1 

P.E.  of  r  deduced  from  p  ■■  ^^^^ 


and  P.E.  „  r       „        „   R=  '^^^^ 


V«  - 1  * 

Thus  the  rank  methods  are  less  accurate  than  the  product- 
moment  method.  "  In  particular  it  requires  about  30  "/o  more 
observations  by  the  R  method  to  obtain  r  with  the  same  degree 
of  certainty  when  r  =  0." 

{d)  When  R  works  out  negative,  Spearman  recommends 
that  one  of  the  series  be  reversed.    But  since  R  is  not  a 

*  Drit.  Journ.  of  Psychology,  Vol.  n.  Pt.  i.  1906,  pp.  107,  108. 
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symmetrical  function  of  p,  this  is  not  a  sound  course  to  adopt. 
In  fact,  if  the  numbers  are  small,  reversal  of  one  of  the  series 
may  yet  leave  R  negative.  Pearson's  own  formula  (e)  is  free 
from  this  drawback*. 

An  important  argument  brought  forward  by  Spearman  f  is 
one  to  the  effect  that  absolute  measurements  are  often  hetero- 
geneous and  that  ranking  removes  this  heterogeneity.  There  is 
an  assumption  underlying  this  argument  which  would  seem  to 
be  of  somewhat  doubtful  validity.  All,  or  almost  all,  so-called 
mental  measurement  is  measurement  of  the  object.  It  is 
physical,  not  mental  at  all.  The  physical  objects,  the  physical 
results  can  be  measured — in  terms  of  space  and  time — and  they 
can  be  correlated.  Psychical  measurement  has  perhaps  some 
plausibility  in  the  case  of  sensation  intensities,  in  the  form  of 
the  "  sense-distances "  of  Delboeuf ;  but  this  furnishes  small 
justification  for  the  claims  made  above  on  behalf  of  ranks.  The 
unit  of  rank  means  nothing  until  the  form  of  distribution  is 
fixed,  and  this  can  hardly  be  determined  on  a  purely  psychical 
basis ;  at  least  it  is  difficult  at  present  to  conceive  of  even  an 
attempt  at  such  a  determination. 

Another  important  passage  occurs  on  p.  99  of  the  previously- 
quoted  article,  "  In  this  simplified  form  the  standard  or  r  method 

To  .1  .V,    f       1     1  ^S{v,-v,r 

Spearman  apparently  means  the  tormula  1  -  -\T/]ai_-i\  ' 


shows  itself  to  be  solely  distinguished  from  our  short  or  R 
method  by  the  fact  that  the  differences  of  rank  are  squared. 
The  efifect  of  squaring  is  to  give  more  '  weight '  to  the  extreme 
differences  as  compared  with  the  medium  ones.  This  is 
probably  a  considerable  advantage  in  most  physical  measure- 
ments. But  in  other  fields  of  research,  and  perhaps  above  all 
in  psychology,  these  extreme  cases  are  just  the  ones  of  most 

*  Spearman  endeavours  to  meet  some  of  these  criticisms  in  a  recent  paper; 
British  Journal  of  Psychology,  Vol.  in.  Pt.  3,  Oct.  lUlO. 

t  C.  Spearman:  "'Foot-rule'  for  Measuring  Correlation,"  Bntish  Journal 
of  Psychology,  Vol.  n.  Pt.  i.  July  1906,  p.  93. 
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suspicious  validity  [why  ?]  so  that  squaring  is  here  more  likely 
to  do  harm  than  good." 

Such  an  argument  is  sound  as  applied  to  actual  measure- 
ments, but  as  soon  as  those  measurements  have  been  translated 
into  ranks,  it  loses  its  force.  The  extreme  ranks  are  viore 
reliable  than  the  medium  ones  since  they  are  farther  apart  from 
one  another,  and  therefore  the  extreme  differences  are  more— 
not  less— reliable  than  the  medium  ones. 

Finally,  Spearman  says  that  he  deduced  his  formula 


=  sin  (I 


R 


empirically  from  111  correlations,  the  average  number  of  cases 
in  each  being  21.  Now,  for  ?i=  21  we  have  seen  that  the  P.E. 
is  large,  even  larger  than  that  given  by  the  usual  formula, 

1  2 

•67449  LZJL  .    Hence,  the  chance  that  the  selected  formula 

'J  71 

was  the  best  one  can  not  have  been  gi*eat.  Doubtless  many 
others  would  have  fitted  equally  well ;  and  if  Spearman  wishes 
to  support  his  formula  as  against  Pearson's  on  the  strength  of 
the  fact  that  it  was  deduced  empirically  from  actual  cases  of 
psychical  correlation  and  therefore  was  likely  to  be  best  suited 
to  the  forms  of  frequency  distribution  commonly  followed  by 
psychical  measurements,  such  an  argument  would  be  of  little 
cogency. 

In  all  work  where  accuracy,  and,  above  all,  thoroughness  are 
required,  the  method  of  ranks  should  be  avoided  since  it  gives 
neither  the  mean  nor  the  standard  deviation — quantities  im- 
portant in  themselves  and  valuable  as  a  means  of  obtaining  the 
coefficient  of  variation  of  the  group  in  the  particular  mental 
functions  under  consideration.  Ranks  hide  variations  of  great 
psychological  importance,  and  introduce  a  spurious  homogeneity 
which  can  well  be  dispensed  with.  When  the  number  of  cases 
is  large,  ranking  takes  too  long  a  time,  and  the  grouping  of  the 
individuals  in  a  correlation  table  gives  much  more  information 
as  to  the  nature  of  the  correlation.  Even  when  the  numbers 
worked  with  are  comparatively  small  (say  30 — 100),  means, 
standard  deviations,  and  coefficients  of  variation  are  results  of 
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considerable  intrinsic  value,  and  in  finding  the  standard  devia- 
tion the  deviations  from  the  means  have  to  be  written  down,  so 
that  everything  needful  for  the  calculation  of  r  is  already 
determined,  and  the  evaluation  of  correlation  coefficients 
between  the  various  series  of  measurements  taken  in  pairs  is  an 
operation  quickly  performed.  Another  reason  for  drawing  up 
a  correlation  table  whenever  possible — and  this  of  course  is 
impossible  with  ranks — is  that  only  thereby  can  the  correlation 
ratio,  7],  be  calculated,  a  result  which  tests  the  linearity  or  non- 
linearity  of  the  regression,  and  is  moreover  a  better  measure  of 
causal  relationship  than  r  itself. 

Only  when  n  is  small  (<  30)  are  ranks  to  be  recommended 
in  preference  to  actual  measurements  (since  means,  S.D.'s,  etc; 
are  in  these  cases  of  little  value),  and  then  p  is  as  easily 
calculated  as  i?,  and  more  reliable.  The  results  with  such 
small  numbers  could  do  little  more  than  indicate  the  existence 
of  correlation  in  certain  cases.  They  could  give  practically  no 
information  as  to  its  size  owing  to  the  large  amount  of  probable 
error. 

VI.    Spurious  Correlation. 

Correlation  is  said  to  be  "  spurious  "  when  it  is  due  to  ex- 
traneous conditions  and  does  not  arise  directly  out  of  the  two 
functions  under  consideration.  The  term  is  one  of  relative 
rather  than  absolute  significance,  but  its  appropriateness  will 
become  apparent  after  a  consideration  of  the  follomng  two 
examples : 

1.  Spurious  correlation  arising  from  heterogeneity  of 
material. 

Let  us  suppose  that  two  distinct  groups  of  children  have 
been  measured  for  two  characters  A  and  B,  and  that  the  mean 
abilities,  in  both  A  and  B,  are  higher  in  the  one  group  than  in 
the  other  Then,  even  if  there  is  no  correlation  between  the 
two  characters,  as  estimated  from  each  group,  separately,  a 
positive  correlation  will  be  obtained  from  the  two  groups  taken 
together.  On  the  other  hand,  if  the  mean  ability  in  A  is  higher 
and  the  mean  ability  in  B  lower,  in  the  one  gi'oup  than  in  the 
other  a  negative  correlation  will  be  obtained  by  taking  the  two 
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groups  together.  The  correlation  in  each  case  will  be  due 
simply  to  the  heterogeneity  of  the  material  employed.  The 
difference  in  the  mean  values  for  the  two  groups  must  of  course 
have  some  cause,  such  as  a  difference  of  nationality,  sex,  or 
even  locality  (within  any  one  town  or  district)  from  which  the 
children  are  drawn ;  or,  again,  such  a  cause  as  the  difference  of 
discipline  to  which  the  two  groups  have  been  subjected  in  the 
past.  These  are  extraneous  conditions  and,  if  measurable,  can 
be  allowed  for  by  employing  the  method  of  partial  correlation. 
As  a  rule,  however,  they  are  not  easy  to  determine  quantitatively; 
hence  their  dangerous  character, 
2.    Index  Goi'relation* . 

This  is  a  form  of  spurious  correlation  which  arises  from  the 
use  of  ratios  for  measurements.    Thus  if 

Zl  =  —  ,  , 

and  Xi,Xi,  x^  are  uncorrelated  with  one  another,  it  can  be  shown 
that  the  correlation  between  z-^  and  z^. 


^-3 


a  quantity  which  may  be  as  large  as  '5. 

As  an  illustration  from  psychology,  we  may  mention  the 
correlation  of  .errors  of  observation,  either  of  different  individuals 
or  of  the  same  individual  at  different  times.  Even  if  the 
absolute  errors  are  entirely  uncorrelated,  the  percentage  errors 
(errors  expressed  as  percentages  or  other  fractions  of  the 
observed  quantity)  will  often  show  a  large  amount  of  cor- 
relation. In  such  a  case  as  this,  the  important  question  arises 
whether  we  are,  from  other  considerations,  justified  in  using 
ratios  (or  indices)  rather  than  absolute  measures.  If  we  are, 
the  index  correlation  is  not  to  be  regarded  as  spurious  f. 

*  Karl  Pearson:  "On  a  form  of  Spurious  Correlation  which  may  arise  when 
Indices  are  used  in  the  Measurement  of  Organs,"  Proc.  Roy.  Soc.  Vol.  lx.  1897, 
p.  489. 

t  Spurious  correlation  of  this  (as  of  any)  kind  can  be  eliminated  by  partial 
correlation. 


78 


THE  MATHEMATICAL  THEORY  OF  CORRELATION 


VII.    The  Significance  of  the  Correlation  Coefficient. 

A  correlation  between  two  series  of  measurable  characters 
may  be  regarded  quite  generally  as  due  to  some  sort  and 
degree  of  community  or  identity  of  causation,  and  the  coefficient 
of  correlation,  r,  is  some  sort  of  measure  of  this  identity  of 
causation.  We  may,  then,  quite  legitimately  inquire  what 
numerical  relation  r  bears  to  the  identity.  To  take  an  example 
from  Psychology:  when  we  find  that  the  (partial)  correlation 
between  performance  in  Algebra  and  performance  in  Arithmetic 
is  -75,  does  this  mean  that  75  "/o  of  combined  algebraical  ability 
and  arithmetical  ability  is  common  to  the  two,  the  mental 
operations  being  practically  identical  in  the  two  subjects  for 
this  portion,  or  does  the  -75  measure  the  degree  of  identity  in 
some  more  complicated  way  ? 

The  general  solution  of  the  problem  appears  to  be  impos- 
sible. Its  meaning  may,  however,  become  clearer  after  a 
consideration  of  a  particular  case  in  which  the  significance  of 
the  correlation  coefficient  is  apparent.  The  case  referred  to  is 
an  experiment  in  dice-throwing  devised  by  the  late  Professor 
Weldon*,  and  called  after  him  Weldon's  experiment f.  Let  a 
dozen  dice  be  shaken  up  in  a  box  and  thrown  again  and  again, 
the  number  of  dice  showing  four  or  more  spots  on  the  upper- 
most face  in  each  throw  being  recorded.  Then  if  the  results 
of  the  1st,  3rd,  5th,  etc.  throws,  and  those  of  the  2nd,  4th,  6th, 
etc.,  be  written  down  in  the  form  of  two  series,  the  2nd 
opposite  the  1st,  the  4th  opposite  the  3rd,  etc.,  these  two 
series  will  obviously  show  no  correlation,  since  the  result  of  the 
2nd  throw  is  in  these  circumstances  quite  independent  of  the 
result  of  the  1st,  and  likewise  with  the  4th  and  3rd,  the  6th 
and  5th,  etc.  Let  now  the  second  of  each  of  these  pairs  of  throws 
be  made  dependent  on  the  first  of  each  pair,  in  the  following  way. 
Let  a  certain  proportion  of  the  dice,  say  6,  be  stained  red,  and 
then,  after  throwing  all  the  dice  and  counting  indiscriminately 

*  W.  F.  B.  Weldon :  "  Inheritance  in  Animals  and  Plants  "  in  Lectures  on 
the  Method  of  Science,  edited  by  T.  B.  Strong,  Oxford,  Clarendon  Press,  1906, 
pp.  100-106. 

t  A.  D.  Darbisbire:  "Some  Tables  for  illustrating  Statistical  Correlation," 
Memoirs  and  Proceedings  of  the  Manchester  Literary  and  Philosophical  Society, 
Vol.  LI.  Pt.  in.  1906-1907. 
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all  those  showing  4  or  more  spots  on  the  uppermost  face,  for 
the  second  throw  gather  up  only  the  white  dice,  leaving  the 
red  upon  the  table,  and  throw,  counting  however  both  white 
and  red  dice  when  computing  the  score.  The  number  due  to 
the  red  dice  will  thus  be  common  to  the  two  scores.  Successive 
pairs  of  throws  are  then  made  in  the  same  way.  If  now  the 
two  series  of  values  be  compared,  they  will  be  found  to  be 
correlated  together ;  a  result  to  be  expected,  since  the  score  of 
the  second  throw  of  each  pair  was  to  a  certain  extent  dependent 
on  that  of  the  first.  The  actual  value  of  the  correlation 
coefficient  will  be  found  to  be,  in  the  case  taken,  or  "5. 
Quite  generally,  if  n  is  the  total  number  of  dice  thrown,  and  I 
the  number  left  on  the  table  after  the  first  of  each  pair  of 
throws, 

I 


r  = 

n 


This  result  admits  of  a  theoretical  proof  as  follows : 
Let  the  successive  scores  of  the  pairs  of  throws  be 


and  let 


fll  +  iCi 

Cii  +  6a 

tta  +  Co 

Cg  +  62 

as  +  Cs 

Cs  +  h 

Cjsr+by 

x  =  a  +  c, 

2/  =  &  +  c. 

Then  rv„  =  ^i^^L^t^^ 


_S{x-yy    S{a-by    S{a')    S  (b^ 

-  ^  _  ^-  +  =  + ^^^^ 
^.=^§(ttir_sm  Sic')  , 


Let  I  =  no.  of  dice  left  on  table  1 
»  m=  „    „    „    thrown  twice  | '  ^  +  = 
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Then,  since  the  distribution  of  results  in  the  throwing  of  n 
dice  N  times  is  given  by  the  successive  terms  of  the  binomial 
expansion  of 

  ^(p  +  q)"", 

of  which  the  S.D.  =  'Jnpq  (putting  N  equal  to  unity), 

.-.    (r.^  =  lpq,        a{-  =  mpq, 

,  I  I 

and  r^y  =  y- —  =  - . 

l  +  m  n 

In  the  case  of  Weldon's  experiment,  then,  the  proportion  of 
factors  common  to  the  two  series  is  given  by  the  correlation 
coefficient  itself,  but  in  the  cases  ordinarily  met  with  in  biology 
and  psychology  the  relations  of  the  factors  at  work  must  be 
much  more  complex.  The  extent  of  identity  of  causation  is 
measured  by  (f>  (r),  some  function  of  r,  but  the  form  of  this 
function  is  unknown  to  us.  It  is,  for  example,  an  unwarranted 
assumption  to  say,  as  Spearman  does,  that  the  square  of  the 
correlation  coefficient  measures  "  the  relative  influence  of  the 
factors  in  A  tending  towards  any  observed  correspondence 
(with  another  variable  B)  as  compared  with  the  remaining 
components  of  A  tending  in  other  directions*." 

*  C.  Spearman :  "  The  Proof  and  Measurement  of  Association  between  Two 
Things,"  American  Journal  of  Psychology,  Vol.  xv.  Jan.  1904,  p.  75. 
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The  histoiy  of  the  use  of  the  theory  of  correlation  in 
Psychology  can  hardly  be  said  to  have  begun  earlier  than  the 
commencement  of  the  present  century.  During  the  previous 
twenty  years,  indeed,  a  great  deal  of  work  had  been  done  by 
many  observers  in  measuring  simple  mental  abilities  (by  the 
"mental  test"  method)  in  larger  or  smaller  groups  of  subjects, 
and  attempts  had  even  been  made  to  determine  in  what  way 
these  abilities  were  related  to  one  another  and  to  more  general 
mental  ability,  or  "  general  intelligence." 

Owing,  however,  to  a  universal  lack  of  knowledge  of  the 
mathematical  theory  of  correlation  among  psychologists  during 
this  period,  the  results  were  not  obtained  in  a  form  suitable  for 
comparison  with  one  another,  so  that  it  is  not  surprising  to 
find  that  they  hopelessly  contradict  one  another.  The  hetero- 
geneity of  the  material  worked  with,  the  non-elimination  of 
irrelevant  factors,  and  the  absence  of  any  measure  of  the 
"probable  en-or"  of  the  results  make  the  conclusions  drawn  by 
the  investigators  themselves  from  their  researches  utterly 
unreliable. 

The  first  investigation  showing  any  mathematical  precision 
was  that  published  by  Clark  Wissler*  in  1901.  It  contained 
(inter  alia)  an  account  of  the  careful  application  of  a  large 
number  of  simple  mental  tests  upon  over  200  college  students, 
and  a  correlation  of  the  results  with  one  another  and  with  the 
students'  marks  in  the  various  subjects  of  the  college  curri- 
culum. The  mental  tests  were  found  to  correlate  but  slightly 
with  one  another  or  with  ability  in  college  subjects  of  study, 

*  Clark  Wissler:  "The  Correlation  of  Mental  and  Physical  Tests," 
Fsychological  Review,  Monograph  Supplement,  Vol.  ni.  No  16  June  1901 
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though  these  latter  showed  considerable  correlation  with  one 
another  (-80— -75). 

In  the  following  year  Aikins  and  Thomdike*  published 
results  which  were  in  a  sense  confirmatory  of  those  of  Wissler, 
since,  notwithstanding  the  greater  similarity  to  one  another 
of  the  functions  investigated  than  in  Wissler's  research,  the 
correlations  were  again  found  to  be  low.  For  example,  different 
tests  devised  for  the  measurement  of  "  speed  of  association " 
were  found  to  show  hardly  any  correlation — a  result  which 
seems  to  furnish  some  justification  for  the  authors'  statement 
that  "quickness  of  association  as  an  ability  determining  the 
speed  of  all  one's  associations  is  a  myth"  {op.  cit.  p.  375). 
A  similar  lack  of  close  relationship  was  found  in  the  case  of 
other  mental  functions  which  would,  on  the  evidence  of  intro- 
spection alone,  be  confidently  classed  as  particular  instances  of 
the  same  general  mental  function. 

In  C.  Spearman's-f-  researches,  published  in  1904  and  1906, 
a  change  in  tactics  appears.  Instead  of  measuring  large 
numbers  of  individuals,  as  his  predecessors  had  done,  Spearman 
contents  himself  with  small  numbers,  groups  of  less  than  40  in 
his  first  research,  and  as  few  as  11, — in  the  case  of  one  of  the 
tests,  7 — in  the  second  ;  but  he  proposes  to  make  up  for  the 
unreliability  thus  introduced  into  the  results  by  a  more  careful 
measurement  of  his  cases,  and  the  application  of  "  corrections  " 
to  the  "  raw  "  values  of  his  correlation  coefficients  by  means  of 
appropriate  mathematical  formulae. 

The  "  corrections  "  are  of  two  kinds.  One  is  that  for  the 
"  elimination  of  irrelevant  factors  "  and  is  nothing  new  in  the 
theory  of  correlation,  being  simply  the  method  of  "  partial " 
correlation  (see  p.  63  above). 

For  example :  To  eliminate  effects  of  difference  of  age  in 
the  group  experimented  on  one  would  determine  the  "  partial " 

*  "Correlations  in  the  Perceptive  and  Associative  Processes,"  Psychological 
Review,  Vol.  ix. 

t  C.  Spearman:  "'General  Intelligence'  Objectively  Determined  and 
Measured,"  American  Journal  of  Psychology,  Vol.  xv.  1904,  pp.  201-292. 

F.  Krueger  and  C.  Spearman:  "Die  Korrelation  zwischeu  verschiedeneu 
geistigen  Leistungsfahigkeiten,"  Zeitschrift  fiir  Psychologic,  Bd.  xliv.  1906. 
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correlation  between  the  two  characters  under  consideration,  for 
"  age  constant."  Thus,  in  Yule's  notation,  if  1  and  2  represent 
the  characters  to  be  compared  and  3  represents  "  age," 

'"12  ~  ^'13  ^23 


''12.  3  


V(l-r^,)(l-r'-^,3) 
A  similar  procedure  is  needed  for  eliminating  the  effects  of 
differences  of  sex,  etc.  A  preferable  course,  however,  would  be 
to  dispense  as  far  as  possible  with  the  necessity  for  such 
correcting  by  selecting  groups  of  individuals  of  the  same  age, 
sex,  etc. 

The  other  method  of  correction  advocated  is  that  for  the 
"elimination  of  observational  errors."  Obviously,  errors  of 
observation  must  make  any  correlation,  worked  from  measure- 
ments containing  them,  different  from  (generally,  though  not 
universally,  less  than)  the  true  value  of  the  coi-relation.  The 
formula,  in  its  full  form,  which  Spearman  has  proposed  as  a 
means  of  correcting  for  this,  and  of  which  he  has  given  a  proof 
in  the  Am.  J.  P.*,  is  as  follows: 

a/a*  ^  7^ 


r. 


xy 


where  x,  y  are  the  true  values  to  be  correlated,  and  x^,  x„,  y^,  ?/, 
are  two  pairs  of  obtained  values. 

^a^ix,.  and  similar  coefficients,  are  called  by  him  "  reliability 
coefficients."  They  represent  the  correlation  of  two  distinct 
series  of  measurements  of  the  same  mental  capacity. 

The  proof  of  Spearman's  formula  is  only  valid  on  the 
assumption  that  the  errors  of  measurement  are  uncorrelated 
with  each  other  or  with  x  or  y\.  Thus,  if  x,  y,  5,  e  represent 
deviations  from  means,  and 

x^  =  x  +  hu       x^  =  x  +  h., 

y^  =  y  +  €i,     2/2  =  2/ +  62, 

*  C.  Spearman:  "Demonstration  of  Formulae  for  True  Measurement  of 
Correlation,"  American  Journal  of  Psycholooy,  Vol.  xvii.  1906. 

t  See  Mr  G.  Udny  Yule's  short  proof  of  the  formula,  quoted  in  my  pamphlet 
on  "Some  Experimental  Results  in  Correlation,"  Comptcs  Rcndus  dn  VP"' 
Congrls  International  de  Psychologie,  Gen6ve,  Aoftt  1909. 
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then  S(xB),  etc.  =  0,       S  (S,S,),  etc.  =  0  ; 

also  S  {x,y,)  =  S  (xy)  =  S  {x,y,). 

[All  this  is  involved  in  Mr  Yule's  proof.] 

Now,  these  are  very  large  assumptions  to  make.  Even  in 
cases  where  the  quantities  B,  e  are  genuine  errors  of  measure- 
ment, there  are  strong  reasons  for  assuming  (on  general 
principles  and  also  from  experimental  evidence)*  that  they  will 
be  correlated.  But  in  the  case  of  almost  all  the  simpler  mental 
tests  the  quantities  8  and  e  are  not  errors  of  measurement  at 
all.  They  are  the  deviations  of  the  particular  performances 
from  the  hypothetical  average  performance  of  the  several 
individuals  under  consideration.  Thus  they  represent  the 
variability  of  performance  of  function  within  the  individual. 
When  an  individual  in  the  course  of  three  minutes  succeeds  in 
striking  through  100  es  and  r's  in  a  page  of  print  on  one  day, 
and  94  under  the  same  conditions  a  fortnight  later,  there  is  no 
error  of  observation  involved.  The  numbers  100  and  94  are 
the  actual  true  measures  of  ability  on  the  two  occasions.  The 
average  or  mean  ability,  which  is  the  more  interesting  measure 
for  the  purposes  of  correlation,  is  doubtless  different  from 
either,  but  that  does  not  make  the  other  two  measures  erroneous. 
Evidently  in  these  cases  S  and  e  represent  individual  variability, 
and  to  assume  them  uncorrelated  with  one  another  or  with  the 
mean  values  of  the  functions  is  to  indulge  in  somewhat  d  ^rtori 
reasoning. 

There  are  two  comparatively  simple  ways  of  testing  the 
assumption : 

( 1 )    S  (x,y,)  =  S  (xy)  =  S  (x^,), 

.'.  S (x^yi)  —  S (x^yi)  should  =0  within  the  limits  of  the 
probable  error  of  the  difference. 

I  have  applied  this  test  to  the  case  of  correlation  between 
accuracy  in  bisecting  lines  and  accuracy  in  trisecting  them  in 
43  adult  subjects. 

*  See  Karl  Pearson  :  "  On  the  Mathematical  Theory  of  Errors  of  Judgment, 
with  special  reference  to  the  Personal  Equation,"  Phil.  Trans.  A,  Vol.  198, 
pp.  235-299. 
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Here        S  {b,t^)  -  B  {h,t,)  =  137780  -  60036 

=  77744, 


RE.  of  >Sf  {xy)  =  -67449  ^P^Hl^ , 


in  Pearson's  notation. 


•67449      8{xyf  S(w')S(y') 


p.E.  of  S  {h,U)  =  687  ;       p.e.  of  S  {h,Q  =  365  ; 


.  • .  p.E.  of  S  (h,U)  -  S  (hA)  =  V687^  +  365^ 

=  778. 

Since  778  is  less  than  one-third  of  77744,  the  formula 
cannot  be  employed  to  obtain  the  correlation  between  mean 
abilities  in  bisecting  and  trisecting  lines. 

C2)    r  >Sf{(a;i-a;,)(3/i-ya)} 
Y\-t  ^S(x,-x,y.S{y,-y,r 


=  0 

if  errors  are  uncorrelated  with  one  another  (since  numerator 
then  =0). 

Applying  this  test  to  the  same  case  of  bisection  and  tri- 
section,  I  get 

ra,-s,  =  0-30±009, 

which  proves  once  more  the  inapplicability  of  the  formula. 

I  applied  test  (2)  also  to  the  case  of  correlation  between 
speed  of  addition  of  figures  and  accuracy  of  addition  in  a  group 
of  38  school  children  (girls  between  the  ages  of  11  and  12, 
Group  II  below)  and  found 

^•-s,-^,  =  0-35  ±0-09. 

Even  when  test  (2)  does  give  the  value  0,  we  can  only 
conclude  from  this  that 

S  (a,e,)  +  S  (S,6,)  =  S  (8,6,)  +  S  (S,6,). 
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We  cannot  conclude  that  the  formula  is  applicable,  unless 
we  have  further  independent  evidence*. 

For  the  reasons  presented  above,  I  should  prefer  to  avoid 
the  use  of  Spearman's  formula— by  increasing  the  number  of 
the  original  measurements  of  each  abilityf— and  would  also 
suggest  that  his  so-called  "reliability"  coefficients  might  in 
some  cases  be  more  appropriately  termed  "  coefficients  of  indi- 
'vidual  correlation,"  since  they  are  more  analogous  to  Karl 
Pearson's  "correlation  of  undifferentiated  like  parts"  than  to 
anything  elsej. 

The  above  discussion  raises  the  interesting  question  as  to 
the  relation  between  ability  and  variability,  and  the  correlation 
coefficient  between  mean  ability  and  the  standard  deviation 
would  be  the  best  measure  of  this  relation.  When  the  measure- 
ments have  been  made  on  only  two  separate  occasions,  the 
expression  r^.  +  x,  might  be  regarded  as  a  rough  measure  of  the 

relation,  and  I  would  suggest  that  it  be  called  a  "variability 
coefficient " — not  to  be  confused  with  the  "  coefficient  of  varia- 
tion," which  =  .    If       and  chance  values,  and  if 

mean  ' 

the  distribution  of  abilities  at  the  given  task  within  one  and 
the  same  individual  is  approximately  normal,  then 

'^x^=  Ifo-  (approx.)  §, 
so  that  there  is  sufficient  justification  for  this  value. 

To  return  to  Spearman's  researches.  In  his  first  series  of 
investigations  J.  P.  Vol.  xv.)  the  following  groups  of 

subjects  were  measured:  24  village  school  children  of  both  sexes, 

*  In  a  recent  paper  (British  Journal  of  Psychology,  Vol.  iii.  Oct.  1910, 
pp.  271  ff.)  Spearman  admits  the  difficulties  attending  the  use  of  his  "correction 
formula,"  and  brings  forward  another  formula  to  correct  for  "accidental"  as 
distinct  from  "constant"  errors.  Since  no  experimental  evidence  is  given  as 
to  the  validity  of  the  assumptions  which  this  new  formula  involves,  I  merely 
content  myself  with  mentioning  the  fact  of  its  publication  here. 

t  This  plan  would  also  have  the  advantage  of  keeping  the  probable  error 
low.  Correction  by  Spearman's  formula  while  raising  the  value  of  the  coefBcieut 
raises  the  size  of  the  p.k.  in  the  same  proportion. 

+  Grammar  of  Science,  2nd  Ed.,  pp.  393,  397. 

§  Karl  Pearson  :  "Francis  Gallon's  Difference  Problem,"  Biometrika,  Vol.  i. 
p.  399. 
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age  limits  10  0— 13-10 ;  23  boys  of  a  high  class  preparatory 
school,  age  limits  9-5- 13-7  ;  and  27  adults  of  both  sexes,  age 
limits  21—78.  The  tests  employed  were  those  for  pitch  discri- 
mination, weight  discrimination  and  discrimination  of  light 
intensities ;  and  measures  of  intelligence  were  obtained,  in  the 
case  of  the  children,  from  results  of  school  examinations,  grading 
by  teachers,  and  grading  of  one  another  by  the  children  them- 
selves (measure  of  common  sense).  Only  in  the  cases  of 
"common  sense"  (village  children)  and  "final  school  order" 
were  reliability  coefficients  calculated.  Consequently  the  reli- 
ability coefficient  for  common  sense  ('64)  was  used  to  bring  up 
the  various  sensory  discrimination  coefficients  to  their  true 
values.  Corrections  were  made  for  age  and  sex,  and  the  final 
results  were  then  found  to  be  comparatively  high  in  the  village 
school  group. 

In  the  preparatory  school  group  the  highest  raw  coefficient 
was  0-32  +  O'OS,  that  between  weight  discrimination  and  light 
discrimination ;  in  the  group  of  adults  that  between  pitch  discri- 
mination and  light  discrimination  was  the  highest,  being  likewise 
0"32  +  0'08.  The  various  school  subjects  in  the  preparatory 
school  (Classics,  French,  English,  Mathematics)  were  found  to 
correlate  highly  with  one  another,  and  when,  with  the  inclusion 
of  pitch  discrimination  and  music,  they  were  arranged  in  rows 
and  columns  in  order,  from  left  to  right  and  from  above  down- 
wards, of  the  average  degree  of  the  correlation  of  each  with  all 
the  rest,  they  were  found  to  form  a  hierarchy  such  that  every 
coefficient  was  greater  than  any  to  the  right  of  it  (in  the  same 
row)  or  below  it  (in  the  same  column).  The  order  was  Classics, 
French,  English,  Mathematics,  Pitch  Discrimination,  Music. 
Spearman  takes  this  to  be  one  form  of  evidence  for  the  unity 
of  the  intellective  function,  defined  as  follows : — "  Wherever 
branches  of  intellectual  activity  are  at  all  dissimilar,  then  their 
correlations  with  one  another  appear  wholly  due  to  their  being 
all  variously  saturated  with  some  common  fundamental  function 
(or  group  of  functions)*."  He  considers  that  by  the  use  of  his 
formula  for  the  correction  of  observational  errors  he  can  demon- 
strate the  same  thing  (the  existence  of  a  "  central  function ") 
*  Am.  J.  P.  Vol.  XV.  op.  cil.  p.  273. 
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and  can  in  particular  show  "  that  the  common  and  essential 
element  in  the  intelligences  wholly  coincides  with  the  common 
and  essential  element  in  the  sensory  functions*."  The  method 
of  proof  is  as  follows  : 

Let  a?!,  be  two  distinct  measures  of  sensory  discrimination, 
and  2/i,  2/2  two  distinct  measures  of  intelligence. 

Then  the  correlation  of  the  function  common  to  the  functions 
measured  by  x^,  x^  with  the  function  common  to  the  functions 
measured  by  2/1,  3/2  is  equal  to 

'^^•gi^/i  ^a;,iyi  ^argy,  ^g^?/; 

and  if  the  two  functions  referred  to  are  identical  this  expression 
should  be  equal  to  unity. 

In  the  present  article,  Spearman  uses  a  simplified  formula. 


fxy 


and  puts  for  the  numerator  the  average  of  the  various  correla- 
tions evaluated  between  the  intelligences  and  the  discrimina- 
tions, and  in  the  denominator  puts 

''a!,a;,  =  the  average  correlation  of  the  intellective  gradings 

with  one  another, 
r.y^y^  =  the  average  correlation  of  the  gradings  in  discri- 
mination with  one  another, 
and  in  this  way  gets  results  approximately  equal  to  1  in  the 
different  groups  tested. 

One  or  two  remarks  may  appropriately  be  made  here.  In 
the  first  place,  the  full  formula  is  the  only  one  that  can  be  used 
with  any  meaning  or  justice  since  it  is  the  only  one  which  issues 
logically  from  the  mathematical  proof  In  the  second  place, 
the  applicability  of  the  true  formula  must  be  considered  in  the 
light  of  its  presuppositions  (mentioned  above,  p.  83). 

S  should  =  8  (Xi'i/i)  within  the  limits  of  probable  error, 
and  »'.r,-a;j  should  be  zero.    Indeed,  the  assumption  that  S  and  e 

are  uncorrelated  with  each  other  or  with  x  or  y  seems  even 

*  Op.  cit.  p.  269. 
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more  unwarrantable  here  than  in  the  case  of  "correcting 
coefficients,  for  which  the  formula  was  originally  devised. 

In  Spearman's  second  research*,  the  subjects  experimented 
on  were  eleven  adults,  and  the  measurements  made  were : 

(1)  the  spatial  threshold, 

(2)  addition  of  figures, 

(3)  memory  of  the  order  of  a  series  of  figures  previously  seen, 

(4)  combination  method  of  Ebbinghaus  (only  7  subjects 

took  this  test),  and 

(5)  discrimination  of  pitch. 

After  corrections  for  disturbing  but  irrelevant  factors  it  was 
found  that  (1)  and  (3)  showed  no  correlation  with  each  other  or 
with  (2),  (4)  and  (5),  but  that  these  latter  correlated  highly 
with  one  another  =  '70,  r,,  =  -68,  =  -64),  and  the  coefficients 
could  be  arranged  to  show  a  hierarchical  order.  The  raw  values 
were  found  to  give  evidence  of  a  "central  factor"  when  tested  by 
Spearman's  correction  formula,  though  it  is  true  that  the 
arithmetic  mean  instead  of  the  geometric  mean  was  evaluated 
in  the  numerator  (an  entirely  unjustifiable  procedure),  thus 
evading  the  difficulty  that  would  have  otherwise  arisen  owing 
to  the  presence  in  it  of  a  zei'O  value f.  After  working  up  some 
further  data,  taken  from  Oehrn's  writings  (number  of  subjects 
employed,  10),  and  finding  the  results  in  agreement  with  the 
hypothesis  of  a  "central  factor,"  and  finding  too  that  the  correla- 
tion coefficients  for  the  same  pair  of  abilities  increase  with  increase 
of  practice  and  thus  contradict  the  view  of  BinetJ  that  correlation 
in  mental  tests  is  the  result  of  lack  of  practice,  and  diminishes 
in  size  with  increase  of  practice.  Spearman  concludes  his  paper 
with  the  suggestion  (made  "with  the  greatest  reserve  ")  that  the 
central  factor  is  a  plastic  function — "  plastische  Funktion  " — of 
the  nervous  system  which  varies  from  individual  to  individual 
and  exists  as  a  determining  influence  for  all  the  various  psycho- 
physiological functions  of  the  individual.  Whatever  it  is,  it  is 
not  voluntary  attention  (p.  106),  as  Binet  believed  it  to  be§. 

*  Krncger  and  Spearman  :  Zeitschr.  f.  Psych.  Bd.  xmv.,  op.  cit. 
t  Op.  cit.  p.  85.  +  L'Aunee  Pmjc.holooiquc,  vi.  p.  395,  1899. 

§  It  is  perhaps  hardly  necessary  to  point  out  that  the  detailed  consideration 
of  Dr  Spearman's  work  given  here  and  elsewhere  in  the  present  book  is,  and  is 
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Spearman's  theory  of  an  identical  or  absolute  correspondence 
between  "general  discrimination"  and  "general  intelligence" 
has  been  recently  put  to  the  test  by  Messrs  Thorndike,  Lay  and 
Dean*.  The  subjects  examined  were  37  young  women  students 
and  25  high  school  boys.  The  tests  for  sensory  discrimination 
were : 

(1)  drawing  lines  equal  to  given  lines,  and 

(2)  filling  boxes  with  shot  to  equal  in  weight  standard 

weights ; 
those  for  intelligence  were : 

(3)  judgment  of  fellow-students,  and 

(4)  judgment  of  teachers. 

For  the  high  school  boys  (3)  and  (4)  were  combined  teachers' 
and  fellow-students'  judgments,  and  school  marks,  respectively. 

In  the  first  case.  Spearman's  formula  gave  for  the  correla- 
tion of  the  factor  common  to  (1)  and  (2)  with  that  common  to 
(3)  and  (4)  the  value  0-26  instead  of  1-00.  In  the  second  case, 
the  value  was  0-29.  Moreover,  Thorndike  found  a  much  higher 
correlation  between  discrimination  of  lengths  and  discrimination 
of  weights  than  between  either  one  of  them  and  general  intelli- 
gence, the  coefficients  being 

Accuracy  in  drawing  lines,  intelligence    0-15, 

Accuracy  in  making  up  weights,  intelligence  ...  0*25, 
Accuracy  in  drawing  lines  and  making  up  weights  0"50. 
Thus  the  results  were  in  decided  conflict  with  both  parts  of 
Spearman's  concluding  statement  "  that  all  branches  of  in- 
tellectual activity  have  in  common  one  fundamental  function 
(or  group  of  functions)  whereas  the  remaining  or  specific 
elements  of  the  activity  seem  in  every  case  to  be  wholly 
different  from  that  in  all  the  others  f." 

meant  to  be,  a  tribute  to  the  outstanding  importance  of  that  work.  No  one 
can  appreciate  more  vividly  than  does  the  writer  the  valuable  suggestiveness  of 
all  Dr  Spearman's  writings,  above  all  in  respect  to  the  special  precautious 
therein  emphasized  as  requisite  in  measuring  psychical  (as  distinct  from 
anatomical,  etc.)  correlation. 

*  Thorndike,  Lay  and  Dean:  "The  Eelation  of  Accuracy  in  Sensory 
Discrimination  to  General  Intelhgence,"  Am.  J.  P.  Vol.  xx.  July  1909, 
pp.  364-369. 

t  C.  Spearman,  Am.  J.  P.  Vol.  xv.  op.  cit.  p.  284. 
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Thorndike  sums  up  as  follows:  "In  general  there  is  evidence 
of  a  complex  set  of  bonds  between  the  psychological  equivalents 
of  both  what  we  call  the  formal  side  of  thought  and  what  we 
call  its  content,  so  that  one  is  almost  tempted  to  replace  Spear- 
man's statement  by  the  equally  extravagant  one  that  there  is 
nothing  whatever  common  to  all  mental  functions,  or  to  any  half 
of  them*." 

Quite  recently  (Dec.  1909)  there  has  appeared  a  detailed 
account  by  Mr  Cyril  Burtf  of  some  experiments  on  children, 
"  commenced  with  a  view  to  testing  in  practice  the  mathemati- 
cal methods  of  Dr  Spearman,  and  to  verifying  the  experimental 
results  both  of  Dr  Spearman  and  of  Prof.  Meumann  "  {op.  ext. 
p.  95). 

Two  groups  of  subjects  were  experimented  on  :  (1)  a  group 
of  30  boys  (age  limits,  12-6— 13  6)  from  an  elementary  school, 
and  (2)  a  gi-oup  of  13  boys  (same  age  limits)  from  a  prepara- 
tory school.  The  mental  tests  employed  were  discrimination  of 
(1)  weights,  (2)  pitch,  and  (3)  lengths  of  lines  (W.  H.  R.  Rivers' 
adjustable  apparatus  used),  measurement  of  (4)  "spatial 
threshold,"  (5)  rate  of  tapping  (modification  of  Binet's  "  petits 
points"),  (6)  card-dealing,  (7)  card-sorting,  (8)  alphabet-finding, 
(9)  immediate  memory,  (10)  "  mirror  test,"  (11)  "  spot  pattern  " 
test  (tachistoscopic :  McDougall's  apparatus),  (12)  "dotting" 
test  (apparatus  used :  "  dotting  apparatus "  of  McDougall, 
devised  for  testing  and  recording  continued  maximal  voluntary 
concentration  of  attention).  A  measure  of  "  imputed  intelli- 
gence" was  also  obtained,  from  judgments  of  teachers  and 
school-fellows. 

Tests  (1),  (2),  (3)  and  (4)  were  classed  as  sensory  tests. 

Tests  (5)  and  (6)  were  classed  as  motor  tests. 

Tests  (7)  and  (8)  were  classed  as  sensori-motor  tests. 

Tests  (9),  (10)  and  (11)  were  classed  as  association  tests. 

Test  (12)  was  classed  as  a  test  of  voluntary  attention. 

The  tests  were  applied  to  the  children  individually,  and 
repeated  at  least  once.  The  reliability  coefficients  were  in  most 

•  Thorndike,  Lay  and  Dean  :  op.  cit.  p.  368. 

t  Cyril  Burt:  "Experimental  Tests  of  General  Intelligence,"  British 
Journal  of  Psychology,  Vol.  m.  Parts  i.  and  ii.  Dec.  1909,  pp.  94-177. 
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cases  high,  only  those  of  tests  1  (preparatory  school),  3,  5,  7  and 
8  (preparatory  school),  10,  and  11  being  under  -6,  and  of  these 
only  7  and  8  (preparatory  school)  were  lower  than  -5 ;  so  that 
the  average  ability  of  each  individual  tested  was  determined 
with  fairly  close  approximation. 

The  sensory  tests  were  found  to  correlate  but  slightly  with 
one  another  (see  p.  177,  foot  of  page),  and  the  highest  correlation 
with  imputed  intelligence  was  that  of  pitch-discrimination  in 
the  elementary  school,  which  =  0-40  ±  010.  The  remaining  tests 
correlated  much  more  closely  with  one  another  and  with  imputed 
intelligence.  The  chief  general  result  of  importance  which  the 
author  considers  himself  to  have  demonstrated  conclusively  is 
that  the  coefficients  form  a  "hierarchy"  (see  p.  87  above)  in 
both  groups  of  subjects  tested.  Considering  this  result  due  to 
the  existence  of  a  "  central  factor,"  he  suggests  that  the  central 
factor  indicated  is  "  voluntary  attention,"  since  the  dotting  test 
(no.  12)  heads  the  hierarchy  in  both  cases,  and  also  "the  intelli- 
gence coefficients  for  successive  series  with  the  same  test " 
diminish  from  the  first  onwards  (except  in  the  case  of  tests  5 
and  11). 

This  conclusion  contradicts  Spearman's  view*,  and  coincides 
with  the  view  of  Binetf. 

Psychologically  the  research  is  excellent.  The  tests  are  in 
many  instances  original,  and  the  possible  psychological  factors 
involved  in  each  are  debated  with  exhaustive  completeness. 
With  regard  to  the  "  statistical "  side  of  the  work,  I  should  like 
to  make  a  few  very  brief  remarks : 

(1)  The  number  of  individuals  tested  in  the  preparatory 
school  (13)  is  far  too  small.  The  size  of  probable  errors  is  not 
known  for  such  small  numbers,  though  it  must  be  large.  Again, 
the  two  groups  tested  are  not  comparable  to  one  another.  It 
is  a  pity  that  more  groups  of  each  kind  of  school  were  not 
examined,  in  order  to  obtain  results  that  would  be  comparable 
and  would  serve  as  "  controls  "  of  one  another. 

(2)  It  is  a  pity  that  Spearman's  formula,  ?•  =  sin  , 
*  See  above,  p.  89. 

t  L'Annee  Psijchologique,  vi.  p.  395.    See  also  above,  p.  89. 
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should  have  been  employed  (see  above,  pp.  71-76).  In  par- 
ticular, the  proof  of  the  probable  error  of  this  value  given  at  the 
foot  of  p.  109  is  incorrect.    The  true  value  of  the^P.E.  of  r 

calculated  from  R  (for  zero  value  of  the  coefft.)  is  ,  not 

(see  above,  p.  73). 

(3)  It  is  very  doubtful  whether  the  results  do  jjrove  the 
existence  of  a  "hierarchical"  order.  The  orders  for  the  elementary 
school  and  for  the  preparatory  school  do  not  correspond  at  all 
closely,  the  rank-correlation,  p,  working  out  to  0-56  only.  ^  The 
plan  of  comparing  average  deviations  and  average  P.E.'s  on 
p.  163  is  open  to  serious  objection  (apart  from  the  fact  mentioned 
in  footnote  2  of  the  same  page). 

(4)  The  standard  method  for  investigating  the  inter-rela- 
tions of  the  several  coefficients  would  of  course  be  that  of 
multiple  or  "partial"  correlation*;  but  this  method  was  not 
employed. 

(5)  The  probable  errors  of  the  quantities  in  Table  VIII, 
p.  172,  should  have  been  given,  and  taken  into  consideration  in 
drawing  inferences  from  them. 

These  are  the  few  difficulties  that  occur  to  one  on  a  rapid 
reading  of  the  paper,  though  they  detract  but  little  from  its 
general  usefulness. 

The  results  of  a  very  detailed  investigation  into  the  relation- 
ship of  intelligence  to  the  size  and  shape  of  the  head,  and 
to  other  physical  and  mental  characters,  by  Prof.  Karl  Pearson f, 
appear  in  Biometrika,  Vol.  v.  1906 — 1907.  The  subjects 
measured  were  1000  Cambridge  gi-aduates  and  considerably 
more  than  5000  school  children.  Special  care  was  taken  in 
drawing  up  a  quantitative  scale  of  intelligence,  adjustments 
being  made  so  that  the  results  fitted  a  "  normal "  or  Gaussian 
distribution.  The  correlations  were  worked  out  in  several 
different  ways, — correlation  coefficient  (r),  correlation  ratio  (rj), 
coefficient  of  mean  square  contingency,  and  the  method,  first 
*  See  above  pp.  51,  52  and  63,  64  ;  also  Appendix  iii. 

t  Karl  Pearson  :  "On  the  Relationship  of  Intelligence  to  Size  and  Shape  of 
Head,  and  to  other  Physical  and  Mental  Characters,"  Biometrika,  Vol.  v. 
1906-1907,  pp.  105-146. 
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suggested  in  this  paper,  of  the  analograph.  Pearson  describes 
this  last  method  as  follows  :  "  In  the  case  of  intelligence,  I  take 
a  normal  scale  as  ray  base  line  and  plot  up  the  percentage  of  the 
character  for  each  grade  of  intelligence  along  the  centroid 
vertical  of  the  corresponding  range,  drawing  a  horizontal  line 
to  represent  the  mean  percentage  in  the  population  at  large. 
We  thus  obtain  a  diagram,  which  I  will  venture  to  call  an 
analograph. 

"  If  the  percentage  increases  or  decreases  continually  with 
intelligence  (or  with  the  base  character,  whatever  it  may  be), 
I  term  the  relationship  homoclinal ;  if  the  percentage  does  not 
reach  its  maximum  with  the  maximum  or  minimum  of  intelli- 
gence, I  term  the  diagram  heteroclinal  *." 

The  mental  characters  investigated  (which  alone  interest  us 
here)  were — temper,  popularity,  self-consciousness,  shyness, 
conscientiousness,  quiet  habits,  and  handwriting,  and  in  the  case 
of  the  school  children  the  correlations  of  intelligence  with  these, 
and  also  with  age,  were  found  to  be  as  follows : 


Character 

Boys 

Girls 

" 

Conscientiousness 

•46 

•43 

Handwriting 

•28 

•30 

Popularity 

•22 

•30 

Temper  ... 

•19 

•22 

Shyness  ... 

•03 

•18 

Self-Consciousness 

•10 

•03 

Age 

•05 

•08 

Quiet  Habits 

•04 

•09 

The  low  value  of  the  correlation  of  intelligence  with  age 
should  be  noted. 

Space  does  not  allow  of  further  consideration  of  Karl  Pear- 
son's paper.  It  is  undoubtedly  the  most  important  contribution 
that  has  hitherto  been  made  to  the  subject  of  the  relation  of 
intelligence  to  other  mental  and  physical  characters  in  the 
human  organism. 

Other  articles  dealing  with  psychological  classifications  and 
measurements,  which  have  recently  appeared  in  Biometrika,  are 

•  Op.  cit.  p.  129. 
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two  accounts*  of  the  results  of  applying  the  latest  statistical 
methods  to  the  working  up  of  material  collected  by  certain 
psychologists.  In  the  case  of  the  first  of  these  two,  viz.,  the 
research  of  Messrs  Heymans  and  Wiersma  into  the  inheritance 
of  physical  characters,  the  material  had  been  obtained  by  the 
questionnaire  method,  90  questions  in  all  being  sent  out,  divided 
into  six  classes  according  to  the  nature  of  the  characters  to 
which  they  referred.    Examples  of  the  questions  are : 

Class  I.  Movements  and  occupations  (Q.  1—8),  mobility  or 
restfulness ;  steady  or  temporary  industry  or  laziness ;  etc. 

Class  V.  Inclinations  (Q.  44—81),  use  of  alcohol ;  vanity ; 
ambition ;  love  of  truth ;  etc. 

More  than  400  sets  of  answers  were  returned. 

In  cases  where  the  questions  involved  only  two  alternative 
answers,  the  four-fold  table  method  was  used  in  evaluating  the 
correlation  between  parents  and  children ;  where  four  or  five 
alternative  answers  were  demanded,  the  contingency  method 
was  used,  and  the  result  tested  by  a  four-fold  division  of  the 
table.  The  results  were  corrected  for  assortative  mating  by 
means  of  the  correlation  coefficient  between  father  and  mother. 
(In  the  tables  for  coefficient  of  assortative  mating,  the  parents 
were  weighted  with  the  number  of  their  children.)  The  co- 
efficients of  assortative  mating  were  found  to  be  more  frequently 
negative  than  positive.  The  account  in  Biometrika  concludes 
as  follows  :  "  After  allowing  for  the  effect  of  assortative  mating, 
the  mean  coefficients  of  resemblance  between  fathers  and  sons, 
and  between  mothers  and  daughters,  calculated  for  one  set  of 
characters  by  the  contingency  method  and  for  another  set  by 
the  four-fold  correlation  method,  come  in  each  case  to  very 

*  (a)  "  The  Inheritance  of  Psychical  Characters,"  being  further  Statistical 
Treatment  of  Material  Collected  and  Analysed  by  Messrs  G.  Heymans  and 
E.  Wiersma:  by  Edgar  Schuster  and  E.  M.  Elderton,  Biometrika,  Vol.  v.  1906- 
1907,  pp.  460-469. 

(h)  "  On  the  Association  of  Drawing  with  other  Capacities  in  School 
Children,"  by  E.  M.  Elderton ;  being  a  biometric  evaluation  of  the  results  of 
E.  Ivanhoff :  "Recherches  expdrimeutales  sur  le  Dessin  des  Ecoliers  dc  la 
Suisse  Remande,"  Archives  dc  Psychologic,  in.  viii.  1908.  Biometrika,  Vol.  vn. 
pp.  222-226. 
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nearly  ^,  the  value  proposed  originally  for  the  parental  inheri- 
tance coefficient  by  Sir  F.  Galton  "  (p.  469). 

In  the  second  case,  that  of  Ivanhoff 's  data  in  his  investiga- 
tions into  the  relation  of  draiuing  to  other  mental  abilities  in 
school  children,  Miss  Elderton  evaluated  the  results  by  the 
contingency  method  (nine-fold  grouping,  divisions  being  "fort," 
"  moyen,"  and  "  faible ")  and,  in  some  cases,  by  the  four-fold 
table  method  also.  The  conclusions  she  was  able  to  draw  were 
briefly  as  follows : 

(1)  Drawing  correlates  with  other  abilities  more  closely  in 
girls  than  in  boys. 

(2)  The  correlations  in  all  cases  are  extraordinarily  low. 
Miss  Elderton  suggests  that  "  this  may  arise  possibly  from 
capacity  in  drawing  being  a  hereditary  character,  having  small 
association  with  other  school  measures  of  fitness." 

(3)  General  ability  shows  the  highest  of  the  correlations,  in 
both  sexes. 

(4)  The  so-called  "  psycho-pedagogic  characters  " — atten- 
tion, obedience,  industry,  cleanliness,  and  temper — correlate  less 
closely  than  proficiency  in  the  ordinary  subjects  of  the  school 
curriculum  do.  In  the  case  of  the  girls,  they  are  all  the  lowest 
on  the  list. 

Accounts  of  other  investigations  into  the  relationship 
existing  between  the  various  subjects  of  the  school  and  college 
curriculum  are  to  be  found  in  papers  by  W.  P.  Burris, 
A.  G.  Smith,  Brinckerhoff,  Morris  and  Thorndike*.  The  results 
are  somewhat  conflicting,  and  in  every  case  surprisingly  low. 
Thorndike  and  Foxf  found  low  correlations  between  part- 
capacities  of  the  same  subject  (arithmetic) ;  and  Burris 
obtained  a  coefficient  for  algebra  and  geometry  which  was 
nearly  as  low  as  that  between  mathematics  and  a  non-mathe- 
matical subject.  I  have  myself  found |  that,  after  correction  for 
effects  of  irrelevant  conditions,  algebra  and  geometry  show  hardly 

*  All  in  Vol.  XI.  of  Columbia  Contributions  to  Education. 
+  Op.  cit.  pp.  138-143. 

+  William  Brown:    "An  Objective  Study  of  Mathematical  Intelligence," 
Bionietrika,  Vol.  vn.  Part  iii.  1910. 
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any  correlation  at  all-in  fact,  the  partial  coefficient  for 
"arithmetical  ability  constant"  is  zero. 

An  experimental  investigation  of  the  inter-relations  of  some 
of  the  higher  mental  processes  in  50—90  normal  school  students 
was  made  by  A.  Peterson*  in  1908.  The  marks  used  were 
good,  medium  and  poor,  and  the  following  are  some  representa- 
tive coefficients  selected  from  those  obtained  : 
Reasoning  and  generalizing  -95,  Generalizing  and  memory  "40 
„   memory  ...   "40,  „  „     accuracy  '28 

accuracy...  '45,  Memory  and  accuracy  ...  -31 

As  one  further  illustration  of  the  use  to  which  the  theory  of 
correlation  has  been  put  in  Educational  Psychology,  mention 
may  be  made  of  an  investigation  by  Thorndikef  into  the  cor- 
relation between  entrance  marks  and  later  performances  at 
college.    For  total  marks  the  coefficients  were  : 

Freshman  year,  -62 ;  Sophomore  year,  '50 ;  Junior  year,  '47  ; 
Senior  year,  25. 

Taking  the  various  subjects  separately,  Thorndike  found  a 
slight  superiority  over  the  rest  in  the  case  of  science,  and  a 
slight  inferiority  in  the  case  of  mathematics.  The  number  of 
cases  worked  on  was  56 — 84. 

In  a  recent  short  paper  J,  Thorndike  brings  forward  correla- 
tion results  which  he  considers  to  be  "  very  strong  evidence  of 
the  dependence  of  efficiency  of  memory  upon  content  and  of 
the  specialisation  of  mental  functions  in  general."  On  the 
other  hand,  he  finds  that  "  the  relation  between  retention  of 
the  effects  of  an  experience  for  one  or  two  minutes  and  their 
retention  for  one  or  two  days  seems  to  be  one  of  the  closest  yet 
measured  in  human  nature."  The  "raw"  correlation  was  "5^  +  "1, 
(n  =  40),  which  when  corrected  for  mixture  of  sexes  and 
inaccuracy  of  measurement  gave  a  final  value  of  "8  +  '1. 

*  A.  Peterson:  "Correlation  of  Certain  Mental  Traits  iu  Normal  School 
Students,"  Psychological  Review,  Vol.  xv.  1908,  pp.  323-338. 

t  E.  L.  Thorndike:  "An  Empirical  Study  of  College  Entrance  Examinations," 
Science,  N.S.  Vol.  xxiii.  .June  1906,  pp.  839-845. 

X  E.  L.  Thorndike:  "The  Relation  between  Memory  for  Words  and  Memory 
for  Numbers,  and  the  Relation  between  Memory  over  Short  and  Memory  over 
Long  Intervals,"  American  Journal  of  Psychology,  Vol,  xxi.  1910,  pp,  487,  488. 
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CHAPTER  III 


SOME  EXPEEIMENTAL  RESULTS* 

The  research  to  be  described  in  this  chapter  was  devised  for  the 
purpose  of  determining  to  what  extent  correlation  exists  between 
certain  very  simple  mental  abilities  in  cases  where  the  individuals 
experimented  upon  are,  as  near  as  may  be,  identically  situated 
with  respect  to  previous  practice,  general  training,  and  environ- 
ment ;  and  how  closely,  if  at  all,  these  elementary  abilities  are 
related  to  general  intellectual  ability  as  measured  by  teachers' 
judgments,  school  marks,  etc.  Every  effort  was  made  to  keep 
the  groups  of  individuals  tested  as  homogeneous  as  possible ; 
and  instead  of  measuring  irrelevant  factors  and  "correcting" 
for  them  in  the  later  stages  of  the  research,  the  influence  of 
such  irrelevant  factors  was  excluded  right  from  the  beginning 
by  a  rigorous  segregation  of  the  material,  and  in  other  ways. 

The  groups  of  individuals  to  which  the  tests  were  applied, 
were  as  follows : 

Group  I,  66  boys  of  a  London  elementary  school,  all  between 
the  ages  11  and  12. 

Group  II,  39  girls  of  a  London  elementary  school,  all  between 
the  ages  11  and  12. 

Group  III,  40  boys  of  a  London  higher  grade  school,  all 
between  the  ages  11  and  12. 

Group  IV,  56  training  college  students  (women),  of  the  same 
year  and  of  approximately  the  same  age. 

Group  Va,  35  university  students  (men). 

[Group  V6,  23  university  students  (women).] 

*  Published  in  the  British  Journal  of  Psychology,  Vol.  in.  Pt.  3,  Oct.  1910. 
A  few  unimportant  alterations  have  been  made,  and  a  summary  of  the  results 
has  been  added. 
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Little  need  be  said  as  to  the  nature  of  the  groups.  Group 
III  was  as  homogeneous  as  could  possibly  be  expected  or  desired. 
The  individuals  were  not  only  of  the  same  age,  but  also  belonged 
to  the  same  form  and  had  all  worked  for  months  past  under 
exactly  the  same  environment  (same  teacher,  etc.).  They  were, 
however,  a  rigorously  selected  class,  as  might  be  expected  from 
the  character  of  the  school. 

Group  IV  was  also  thoroughly  homogeneous.  During  an 
entire  year  previous  to  the  application  of  the  tests  they  had 
lived  under  exactly  the  same  environment. 

In  Group  II  there  was  a  slight  mixing  of  "standards"  vvhich 
introduced  a  slight  degree  of  heterogeneity,  but  the  effect  of  this 
on  the  results  must  have  been  very  small. 

Group  I  was  also  slightly  heterogeneous  owing  to  mixture  of 
standards,  and  the  results  show  that  the  effect  of  this  was  some- 
what greater  than  in  the  preceding  case. 

Group  Va  was  fairly  homogeneous,  but  was  of  course  a 
"  selected  "  group.  The  same  remarks  apply  to  Group  Yh,  but 
owing  to  the  smallness  of  the  numbers  (23)  tested,  the  results 
were  worked  out  by  the  method  of  ranks  (p),  which  was  con- 
sidered good  enough  under  such  circumstances,  and  they  are 
recorded  avowedly  as  mere  approximations. 

Other  groups  of  school  children  were  also  tested,  but  as  the 
marking  of  the  results  is  not  yet  complete,  no  further  reference 
will  be  made  to  them  here,  except  in  the  case  of  the  "  addition 
tests,"  for  which  see  Appendix  II,  note  at  top  of  first  page. 

As  regards  the  tests  employed,  they  were  chosen  not  so  much 
for  their  novelty  (though  a  few  of  them  are  new  and  the 
methods  of  applying  the  tests  were  determined  in  every  case 
entirely  by  the  requirements  of  the  circumstances),  nor  so  much 
for  their  a  priori  likelihood  of  showing  inter-correlation,  as  for 
their  convenience  in  admitting  of  application  to  an  entire  group 
of  subjects  simultaneously  and  unobtrusively.  The  following  is 
a  list  of  them : 

1.  Crossing  through  letters  e  and  r  in  a  page  of  print. 

2.  Crossing  through  letters  a,  n,  o,  and  5  in  a  page  of 
print. 
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3.  Crossing  through  every  letter  in  a  page  of  print. 

4.  Adding  up  single  digits  in  groups  of  ten.  Measurement 
of  (tt)  speed,  (6)  accuracy. 

5.  Bisecting  ten  printed  lines  (80  mm.  long),  and  putting 
m  one  of  the  points  of  trisection  in  each  of  ten  other  lines 
(90  mm.  long). 

6.  Muller-Lyer  Illusion.  Measurement  of  (a)  size,  (b)  mean 
variation. 

7.  Vertical-Horizontal  Illusion.  Measurement  of  (a)  size 
(b)  mean  variation. 

8.  Mechanical  Memory  (permanent),  tested  by  means  of 
nonsense  syllables. 

9.  Memory  for  poetry. 

10.  Combination  test  (Ebbinghaus). 

In  the  case  of  Groups  II  and  III,  recourse  was  also  had  to : 

11.  Marks  for  Drawing. 

12.  Total  School  Marks. 

18.  Grading  for  General  Intelligence  (two  independent 
measures). 

Finally,  with  Groups  Va  and  V6,  the  following  test  was  also 
employed  : 

14.  Association-time  (uncontrolled).  Measurement  of  rate 
of  sequence  of  ideas  called  up  by  a  stimulus-word. 

The  performances  of  the  several  groups  in  these  tests  admit 
of  comparison  in  terms  of  the  mean,  standard  deviation,  and 
coefficient  of  variation,  provided  that  the  probable  errors  of  these 
constants  are  also  evaluated. 

With  the  exception  of  test  (9),  and,  in  some  cases,  of  test 
(8),  every  test  was  applied  twice,  the  second  test  being  given 
about  a  fortnight  after  the  first,  and  at  the  same  hour  of  the 
day.  In  the  case  of  the  school-children,  I  myself  applied  both 
tests  in  the  presence  of  the  form-master  or  mistress.  The 
adults  whose  measurements  are  recorded  and  employed  in  the 
present  research  were  also  tested,  with  hardly  an  exception,  by 
myself.  It  should  be  added  that  the  research  commenced  with 
a  very  much  larger  number  of  adults  (university  students  and 
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others),  amounting  to  over  100,  but  the  much  smaller  numbers 
recorded  as  Groups  Ya  and  Yb  were  alone  used  for  the  evalua- 
tion of  coefficients,  since  they  alone  displayed  sufficient  reha- 
bility  and  homogeneity  for  the  purpose. 

(1)    ER  Test. 

Pages  of  French  words,  arranged  in  irregular  order  so  that 
they  did  not  "  make  sense,"  and  so  chosen  that  the  number  of 
e's  and  r's  was  approximately  constant  from  line  to  line,  were 
employed.  The  page  was  given  out  face  downwards,  and  at  a 
given  signal  the  subject  turned  it  over  and  proceeded  to  cross 
through  every  e  and  every  r  that  he  came  to,  beginning  with 
the  first  line  and  moving  down  line  by  line,  until  he  received 
the  signal  to  stop.  The  time  allowed  for  the  test  was  5  minutes 
in  the  case  of  the  children  and  3  minutes  in  the  case  of  the 
adults.  The  subject  was  urged  to  avoid  passing  over  any  of 
the  stated  letters  but  otherwise  to  work  as  quickly  as  possible. 
Before  the  commencement  of  the  test,  a  full  explanation  of  it 
was  given  to  the  group,  illustrated  by  examples  on  the  black- 
board. This  was  done  in  the  case  of  every  test.  A  different 
set  of  words  was  employed  in  the  second  test. 

System  of  marking  : 

1  mark  for  every  letter  crossed  through  correctly  ; 
—1  mark  for  every  letter  passed  over  or  crossed  through 
incorrectly. 

Results  of  ER  Test 


Group 

Mean 

(T 

Coefficient 
of 

Variation 

Eeliability 
coefficient 

('•,)  for 
each  test 

Eel.  coeif.  (r„)  for 
amalgamated  pair 
2?-,  t 

of  tests,  ?-o  =  :; — — 

I 

377  +  6 

68  +  4 

18  +  1-1 

-60 

•75 

II 

362  +  8 

71  +  5 

19-5  +  1-5 

-65 

•79 

III 

417  +  6 

57  +  4 

13-7  +  1-] 

•75 

•86 

*V(a) 

204  +  4-5 

41  +  3-3 

20+1-7 

-97 

- 

*  One  test  only. 

t  r2  measures  the  extent  to  which  the  amalgamated  results  of  the  two  testa 
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(2)    AN08  Test. 

The  method  of  procedure  was  identical  with  that  described 
above  for  test  (1),  except  that  the  letters  to  be  crossed  through 
were  of  four  kinds  instead  of  two,  and  that  the  time  allowed 
was  5  minutes  for  children  and  also  for  adults. 


Results  of  ANOS  Test 


Group 

Mean 

Coefficient 
of 

Variation 

Reliability 
coefficient 
'•i 

Reliability 
coefficient  ?-o 

I 

161  +  5 

58  +  3-5 

36  +  2-4 

•77 

•87 

II 

191  +  5-6 

51  +  3-8 

26-5  +  2-2 

•84 

•91 

III 

228  +  6 

56  +  4-1 

24-4  +  2-0 

•81 

-89 

(3)    Motor  Test. 

In  this  test  the  subjects  were  asked  to  cross  through  every 
letter  in  a  page  of  printed  French  words.  Time  allowed  in  all 
cases — three  minutes.  Method  of  procedure  otherwise  identical 
with  that  for  (1)  and  (2). 


would  correlate  with  a  similar  amalgamated  series  of  two  otlier  applications  of 
the  same  test.  If  sc, ,  ,  .t/,  a-^'  be  two  pairs  of  results  (x  denoting,  as  usual, 
deviation  from  the  mean  value),  we  may  assume  that 

(Txi  -  o:v.i  =  <r.vi'  =  crxi'  =  o'.r  (say) , 
and  that  S  (x^x^')  =  S  {x^  x^)  =  S  {x^  a;/)  =  S  [x^  xz')  =  na^- 1\ . 

_  ^  S(xi  +  a;2)(-<  +  -V) 

Hence  we  get  ''■2="^ri — ; — 'iTT,  V 

4/1  r\ 

It. 

=  -  i-.  Q.E.D. 

It  is  easily  seen  that  the  amalgamation  of  4  tests  gives  a  reliability 

coefficient  =,-^'^i-  ;  and,  in  general,  for  n  tests  we  have 
1  +  dl\ 

''"^l  +  (n-l)"ri' 

This  last  formula  furnishes  a  ready  means  of  determining,  from  the  reliability 
coefficient  of  a  single  test,  the  number  of  applications  of  the  test  which  would  be 
necessary  to  give  an  amalgamated  result  of  any  desired  degree  of  reliability. 
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Results  of  Motor  Test. 


Group 

Mean 

<r 

Coefficient 
of 

Variation 

Reliability 
coefficient 

xveiiauiiity 
coefficient 

I 

718  +  13 

148  +  9 

20-6+1-3 

•91 

-95 

II 

720+16 

148  +  12 

20-5  +  1-7 

-85 

-92 

III 

813+11 

103  +  7-7 

12-7  +  1-0 

-76 

-86 

(4)    Addition  Test 

Duplicates  of  pages  from  one  of  Kraepelin's  Rechenhefte 
were  used,  adapted  to  the  purpose  by  the  printing  of  short 
horizontal  lines  below  each  tenth  figure,  and  by  the  omission  of 
all  figures  below  the  thirtieth  in  each  column.  Time  allowed 
for  each  test — five  minutes.  The  speed  of  addition  [4  (a)]  was 
measured  by  the  number  of  sums  (groups  of  ten  digits)  worked 
in  the  given  time,  the  accuracy  of  addition  [4  (b)]  by  the  per- 
centage of  correct  answers.  In  the  case  of  the  children,  4  (a)  was 
measured  by  the  number  of  digits  added,  marks  being  allowed 
for  the  part  of  a  sum  with  which  they  usually  ended. 


Results  of  Addition  Test. 


Group 

Mean 

Speed 

Accuracy 

Speed 

Accuracy 

I 

II 

III 
*IV 
*V(a) 

V(i) 

235  +  6 
210±6 
287  ±  9 
55  +  1-5 
61  +  21 

163  +  3 
149  +  4 
171  +  1-4 
173  +  1-5 
184  +  1-2 

67±4 
52  +  4 
79  +  6 
16+1-0 
19  +  1-5 

31-3  +  1-8 
33-4  +  2-5 
13-3  +  1 
16-8  +  1-1 
11-3  + -9 

•  One  sura  of  10  digits  taken  as  unit ;  in  other  cases,  the  number  of  digits 
added  was  taken  as  the  measure. 
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Results  of  Addition  Test  (cont). 


Lrronp 

Coefficient  of  Variation 

Eeliability 
coefficient 

Eeliability 
coefficient 

Speed 

Accuracy 

Speed 

Acc. 

Speed 

Acc. 

I 

28  +  1-8 

19  +  1-2 

-82 

•33 

•90 

•50 

II 

24-6 +  2-0 

22-4  +  1-8 

•69 

•46 

•82 

•63 

III 

27  +  2-2 

8 +  -6 

-68 

[0] 

•81 

*IV 

30  +  2-9 

9-6 +  -86 

•93 

•29 

•96 

•45 

*V(a) 

32  +  2-8 

6  ±-5 

-95 

•22 

•97 

•36 

Y{b) 

•98 

•59 

■99 

•74 

(5)    Bisection  and  Trisection  of  Lines. 

Every  test  paper  contained  ten  printed  lines,  each  8  cm. 
long,  for  bisection ;  and  ten  printed  lines,  each  9  cm.  long,  for 
trisection.  The  lines  were  printed  three  in  a  row,  and  those 
situated  immediately  under  others  were  shifted  a  little  to  one 
side.  It  was  very  certain,  however,  that  the  bisection  or 
trisection  of  any  one  of  the  lines  was  influenced  by  the  positions 
of  the  neighbouring  lines  and  of  the  edges  of  the  paper.  This 
fact  diminishes  the  value  of  the  test,  but  does  not  of  itself 
deprive  the  test  of  all  use  as  a  measure  of  one  form  of  sensory 
discrimination.  A  more  serious  drawback  was  found  to  be  the 
very  great  individual  variability  displayed,  which  made  the 
reliability  coefficients  very  low.  In  order  to  get  a  more 
reliable  measure  for  Group  V  the  results  for  bisection  and 
trisection  in  both  tests  (i.e.  of  the  division  of  40  lines  in  all  by 
each  individual)  were  thrown  together  and  the  total  taken  as  a 
measure  of  sensory  discrimination.  Onli/  one  point  of  trisection 
was  asked  for,  this  being  put  in  alternately  towards  the  left  and 
the  right  ends  of  the  successive  lines.  Trisection  was  done 
very  unsatisfactorily  by  the  school-children,  but,  in  the  case  of 
adults,  gave  a  higher  measure  of  reliability  than  did  bisection. 

*  One  sum  of  10  digits  taken  as  unit ;  in  other  cases,  the  number  of  digits 
added  was  taken  as  the  measure. 
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The  ave^-age  crude  error  was  taken  as  the  measure  of  in- 
accuracy, since  it  was  found  to  give  more  concordant  results 
(higher  reliability  coefficients)  than  the  other  possible  way  of 
measuring  the  inaccuracy,  viz.  the  mean  variable  error. 


Results  of  Bisection  and  Trisection. 


Group 

Mean 

a 

Coeff.  of 
Variation 

Eel. 
coeff.  ?'i 

Eel. 
coeff.  ?'2 

Bisection }  j 
only  \ 

Bisection  &  ^ 
■   Trisection  "  y^^^^ 

1 

311  ±11 

480  ±14 

129  ±8 
129  ±10 

41  ±2-8 
26  ±2-2 

•35 

( B.  -36 

It.  -35 

(B.  -44 
"(T.  -87 

•52 

B.  -53 
T.  -52 
B.  -93 
T.  -82 

(6)    Milller-Lyer  Illusion. 

The  adjustable  apparatus,  designed  by  Dr  W.  H.  R.  Rivers, 
was  used  to  measure  the  size  of  this  illusion.  The  length  of 
the  standard  line  was  75  mm.,  and  results  recorded  below  are 
also  in  mm.  Each  child  of  Groups  II  and  III  was  tested 
individually  by  myself,  being  asked  to  make  ten  adjustments  of 
the  apparatus,  alternately  lengthening  and  shortening  the 
variable  line,  and  having  the  standard  line  alternately  to  the 
right  and  to  the  left.  To  obtain  the  reliability  coefficient,  the 
results  were  divided  into  two  halves  and  correlated.  The 
average  deviation  was  taken  as  a  size  of  the  illusion,  and  the 
mean  variation  (m.v.)  was  also  determined.  Ten  subjects  of 
Group  IV  were  tested,  10  times  each,  by  one  of  the  mistresses 
of  the  college.  Groups  V  (a)  and  V  (6)  were  tested  by  myself, 
but  four  times  only. 

The  present  test  was  the  only  one  employed  which  involved 
the  use  of  apparatus  or  the  testing  of  the  subjects  separately. 
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Results  of  the  Milller-Lyer  Illusion  Test. 


Group 

Mean 

cr 

Coeff.  of  Variation 

Eel.  coeff.  for 
size  of  illusion 
only 

Size 

M.  V. 

Size 

M.V. 

Size 

M.  V. 

'2  • 

II 

III 

IV 
*V(a) 
*V(6) 

17-3 +  -31 
16-8  +  4 
13-7  +  1-11 
16 +  -67 

3-3 +  -14 
3-1 +  -12 
2-3+ -18 

2-  8+ -22 

3-  8 +-3 
5-3 +  -8 
3-7 +  -48 

1-3  +  -10 
1-1 +  -08 
•8 +  -13 

16-2 +  1-3 
22-5 +  1-8 
38  +  6-5 
23  +  3-1 

39  +  3-4 
36  +  3-1 
36  +  6-1 

-65 
•86 
•76 
•57 
•68 

•79 
•92 
•86 

(7)    Vertical-Horizontal  Illusion. 

The  material  for  this  test  consisted  of  a  set  of  10  large 
L-shaped  figures  clearly  printed  on  a  very  large  sheet  of  paper, 
■which  could  be  folded  in  two.  Each  L  had  unequal  arms,  the 
shorter  being  10  cms.  in  length,  the  longer  14  cms.,  and  the 
vertical  was  alternately  the  shorter  and  the  longer  of  the  two. 
These  papers  having  been  distributed,  it  was  explained  by 
means  of  the  black-board  that  the  task  to  be  performed  was  to 
mark  off  a  part  along  the  longer  arm  (estimated  from  the 
angle)  such  that  it  seemed  equal  to  the  shorter  arm,  the 
subject  limiting  his  attention  strictly  to  each  figure  in  turn, 
and  estimating  hy  eye  only.  When  all  the  ten  figures  had  been 
marked  in  this  way  the  subjects  were  asked  to  go  over  them 
once  more,  altering  those  which  seemed  too  long  or  too  short, 
the  object  of  this  being  to  make  sure  of  the  full  effect  of  the 
illusion  (of  the  existence  of  which,  by  the  way,  not  one  of  the 
subjects  tested — Groups  I,  II,  III  and  IV — was  aware).  A 
fortnight  later  the  test  was  repeated  with  other  papers.  The 
average  size  and  the  M.V.  of  the  illusion  were  evaluated  as  in 
(6)  above.  Two  objections  may  be  made  to  this  method  of 
applying  the  test:  (1)  the  presence  of  surrounding  L's  in- 
fluenced the  judgment;  this  was  partly  obviated  by  the  way 
the  figures  were  arranged  on  the  page,  and  I  believe  the 

*  One  test  only. 
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influence  was  actually  very  small,  each  L  being  large  enough  to 
exclusively  rivet  the  attention  of  the  subject  upon  itself  in  its 
turn ;  (2)  the  eyesight  of  the  subjects  of  the  experiment  was 
not  previously  tested.  This  objection  is  much  more  serious. 
Even  if  the  illusion  is  not  to  be  entirely  explained  as  the  effect 
of  astigmatism,  the  latter  must  play  an  important  part  in 
determining  the  result.  All  we  can  say,  then,  is  that  the  test 
measures  the  balance  of  effect  of  the  various  factors  contributing 
towards  the  falsifying  of  judgments  comparing  horizontal  and 
vertical  distances.  A  somewhat  remarkable  result,  which  I  do 
not  remember  to  have  heard  or  seen  reported  before,  is  that 
with  as  many  as  20  measurements  of  each  subject,  quite  a 
large  proportion  of  the  subjects  show  a  negative  illusion,  i.e.  they 
■jMifZerestimate  the  vertical  instead  of  overestimating  it.  One 
might  retort  that  this  is  simply  a  case  of  ovei^- correction,  were  it 
not  for  the  still  more  remarkable  fact  that  in  the  case  of  all  the 
children  measured,  the  proportion  is  exactly  ^,  in  Group  I,  22 
out  of  66,  in  Group  II,  13  out  of  39,  in  Group  III,  13  out  of  40. 
In  Group  IV  the  second  test  has  unfortunately  not  yet  been 
marked ;  for  the  first  test  alone  the  proportion  is 


Results  of  Vertical-Horizontal  Illusion  Test. 


Group 

Mean 

Eel. 
coefif.  rj 

Eel. 
coefif. 

Size 

M.  V. 

Size 

M.V. 

Size 

M.V. 

Size 

M.V. 

I 

II 
III 

25  +  4-6 
29  +  7-1 
31+8-2 

3-2  ±-11 
3-3  ±-11 

53  +  3 
65  +  5 
76  +  6 

1-3  ±-08 
1-0  ±-08 

•69 
•59 
•75 

•43 
[0] 

•82 
•74 
•86 

•60 
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(8)    Mechanical  Memory  Test 

In  this  test  a  printed  list  of  ten  nonsense  syllables  was 
placed  face  downwards  before  each  of  the  subjects,  and  at  a 
given  signal  the  subjects  turned  the  papers  over  and  applied 
themselves  to  the  learning  of  the  syllables  as  intently  as 
possible.  On  a  second  signal,  2—3  minutes  later,  the  papers 
were  once  more  turned  face  downwards,  and  collected  by  the 
experimenter.  The  subjects  were  then  asked  to  think  no  more 
about  the  syllables  for  the  present.  On  the  following  day,  at 
the  same  hour,  blank  slips  of  paper  were  distributed  and  the 
subjects  were  asked  to  write  down  the  syllables  they  had  learnt 
the  previous  day,  so  far  as  possible  in  the  right  order.  As  a 
system  of  marking  which  was  found  to  be  sufficiently  satisfactory 
for  the  purpose,  2  marks  were  given  for  each  syllable  right  and 
in  the  right  order,  and  1  mark  for  each  right  but  in  the  wrong 
order.  The  time  allowed  for  learning  was  3  minutes  in  the 
case  of  Groups  I  and  II,  but  this  was  found  to  be  too  long  in 
the  case  of  Groups  III  and  IV,  who  were  eventually  given 
2  minutes  and  2^  minutes  respectively.  On  account  of  the 
difficulty  thus  raised  (and,  in  the  case  of  Group  II,  for  another 
reason)  two  series  of  results  could  unfortunately  only  be 
obtained  from  Groups  I  and  IV. 


Results  of  Mechanical  Memory  Test. 


Group 

Mean 

Coefficient 
of 

Variation 

Reliability 
coefficient 

'•i 

Reliability 
coefficient 

'2 

I 

14-5 +  -8 

9-7 +  -57 

67  +  5 

•51 

•68 

*II 

9-6 +  -6 

5-7 +  -43 

59  +  6 

*III 

11-8  +  -8 

6-8 +  -53 

57  +  6 

IV 

31-5 +  -7 

7-7  ±-5 

24  +  2 

•50 

•67 

*  Results  of  one  test  only. 
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(9)  Memory  for  Poetry. 
This  test  was  applied  but  once,  and  to  Groups  I  and  III 
only.  Three  verses  of  Hood's  "  Queen  Mab,"  which  apparently 
neither  of  the  groups  had  seen  or  heard  of  before,  were  set  to 
be  learnt  for  5  minutes,  and  the  subjects  were  asked  to  attempt 
to  reproduce  them  24  hours  later.  The  frequency  constants 
were  found  to  be  as  follows : 


Group 

Mean 

Coefficient  of  Variation 

I 

III 

20  +  -85 
28-6  ±100 

9-7 +  -6 
9-0  ±-75 

49  +  3-7 
31 -6  ±2-9 

(10)    Combination  Test. 

This  was  the  well-known  Gombinations-Methode  of  Ebbing- 
haus,  in  which  the  subject  is  shown  a  passage  of  continuous 
prose  with  from  one-third  to  one-quarter  of  the  words  replaced 
by  blanks,  and  is  asked  to  supply  the  missing  words,  or  words 
of  similar  significance. 

In  applying  this  test,  a  thorough  explanation,  including 
black-board  demonstrations  and  examples,  was  first  given  to 
the  class  and  the  papers  were  then  distributed  face  downwards. 
At  a  given  signal  the  class  turned  the  papers  over  and  pro- 
ceeded to  read  the  passage  through  carefully  (writing  nothing) 
with  a  view  to  grasping  the  general  sense  of  the  entire  passage. 
On  a  second  signal,  three  minutes  later,  they  proceeded  to  fill 
in  the  blanks  in  order  from  the  beginning,  endeavouring  to 
find  in  each  case  a  word  which  would  suit  the  sense  both  of  the 
particular  sentence  in  which  it  occurred  and  also  of  the  entire 
passage.  This  second  period  lasted  five  minutes,  at  the  end  of 
which  time  the  signal  was  given  to  stop.  Such  was  the 
method  of  procedure  in  both  applications  of  the  test  in  the 
case  of  the  school  children,  and  both  results  were  found  to  be 
quite  satisfactory. 

In  the  case  of  the  adults  the  times  allowed  were  different, 
being  I'  +  IO'  for  the  first  passage,  and  l'  +  3'  for  the  second, 
and  the  reliability  coefficient  for  Group  V  was  found  to  be 
abnormally  low.    As  this  seemed  to  be  due  mainly  to  the 


no 
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unsatisfactory  way  in  which  the  first  test  was  performed  (I  had 
chosen  the  passage  badly),  the  results  of  the  second  test  were 
alone  used  for  the  purposes  of  correlation.  In  Group  IV  the 
reliability  coefficient  was  higher,  though  still  not  very  high, 
and  the  two  series  of  results  were  therefore  amalgamated  in  the 
usual  way. 

In  marking  the  papers,  words  supplied  by  the  subject  were 
counted  right  if  they  made  sense  in  their  sentence  and  tolerable 
sense  in  the  entire  passage,  and  Ebbinghaus'  system  of  values 
was  adopted ;  viz., 

Each  blank  filled  in  correctly  =  1  mark, 
„  „     „  incon-ectly  =  —  1  mark, 

„       „    passed  over  =  —  ^  mark. 


Results  of  Combination  Test. 


CoefScient 

Reliability 

Reliability 

Group  • 

Mean 

of 

coefficient 

coefficient 

Variation 

'•i 

'••2 

I 

22  +  1-3 

16-3+  -95 

74  +  6-3 

-74 

•85 

11 

19-7 +  1-2 

11-4+  -87 

.58  +  6 

•56 

•72 

III 

41-6  +  1-8 

17  +1-3 

41  +  3-5 

•73 

•84 

IV 

71-3 +  2-1 

24  +1-5 

33  +  3-3 

•46 

•63 

*V(«) 

18-5+  -78 

7-14+  -56 

38  +  3-4 

•22 

V(6) 

•69 

•82 

Measurements  11,  12,  and  13. 

Measurements  11  and  12  (marks  for  Drawing  and  Total 
School  Marks)  need  no  further  explanation.  The  grading  for 
General  Intelligence  was  obtained  from  two  of  the  schools — 
Groups  II  and  III— from  the  former  of  which  two  separate  and 
independent  gradings,  by  different  teachers,  were  provided. 
These  independent  gradings  correlated,  with  one  another  to  the 
extent  of  '90,  which  gave  a  reliability  coefficient  ?-o  for  the 
amalgamated  grading  =  "95  f. 
*  One  test  only  (the  2nd). 

t  The  high  value  of  this  coefficient  contradicts  the  views  of  certain  writers, 
but  is  borne  out  by  a  later  research  by  Miss  H.  Gertrude  Jones,  whose  results 
have  been  statistically  evaluated  by  Karl  Pearson,  Biomctrika,  Vol.  vii.  1910, 
pp.  642-548. 
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(14)  Association-time. 

This  test  was  applied  to  certain  individuals  of  Group  V 
(a  and  h)  only,  and  was  of  the  following  nature.  A  word  of 
ordinary  significance  (a  noun)  was  read  out  to  the  subjects  and 
they  were  expected  to  write  down  as  rapidly  as  possible,  during 
the  two  minutes  which  followed,  words  representing  the  various 
"ideas"  which  passed  through  their  mind  in  the  time.  After 
a  short  pause,  another  quite  different  word  was  called  out  and 
the  writing  repeated.  Finally  a  third  word  was  called  out. 
The  total  number  of  words  or  phrases  written  down  was  taken 
as  a  measure  of  the  rate  of  sequence  of  associated  ideas  in  the 
subject's  mind.    The  test  was  repeated  a  fortnight  later. 

This  method  gives  fairly  reliable  results — for  Group  Ya, 
ri  =  "67  and  for  Group  Yb,  =  '87 — but  is  vitiated  by  the 
mechanical  process  of  wi-iting.  The  impurity  could  be  elimi- 
nated by  applying  a  simple  writing  test  (speed)  also,  and  then 
employing  the  formula  for  the  "partial"  correlation  of  three  vari- 
ables ;  it  was  not,  however,  done  in  the  present  research. 

General  Remarks  upon  the  Tests. 

The  results  tabulated  in  the  last  few  pages,  when  tested  by 
means  of  the  formula  for  the  P.E.  of  a  difference  [a-x-y  —  '^a^-V 
and  .•.  p.E.a;_2/=  Vp.E.a;^  4-  P.E./],  show  certain  differences  between 
group  and  group  in  respect  of  average  ability,  variability  and 
reliability  for  correlation,  which  justify  our  plan  of  working 
correlation  coefficients  separately  for  the  several  groups,  but  do 
not  seem  otherwise  to  give  many  positive  results  of  general 
significance  and  importance,  such  as,  e.g.,  evidence  as  to  the 
relative  variability  of  the  two  sexes.  A  more  careful  and 
thorough  examination  of  the  tables  may  give  cause  for  some 
qualification  of  the  preceding  statement.  At  any  rate  the 
individual  figures  are  of  considerable  interest.  The  reliability 
coefficients,  even  for  the  single  tests,  are  in  most  cases 
sufficiently  high,— in  fact  much  higher  than  I  had  dared  to 
expect  considering  the  circumstances  that  in  all  but  one  test 
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the  subjects  were  examined  collectively.  The  less  satisfactory 
tests,  as  applied  in  this  research,  seem  to  be  those  for  accuracy 
of  addition,  bisection  and  trisection  of  lines,  m.v.  of  the  vertical- 
horizontal  illusion,  and,  in  a  slighter  degree,  mechanical  memory. 
The  combination  test,  in  the  case  of  adults,  was  the  most 
unsatisfactory,  but  in  the  case  of  the  school  children  it  gave 
fairly  high  results.  The  tests  in  which  two  applications  gave 
very  reliable  results  are  the  motor  test,  the  a  n  o  s  test,  speed 
of  addition,  the  combination  test  (with  children)  and  the  e  r 
test.  The  coefficients  of  variation  are  rather  high,  clustering 
about  the  values  20 — 30 ;  in  a  few  cases  they  are  considerably 
higher. 

The  tests  were  applied  during  the  course  of  the  summer  of 
1909,  and  my  sincere  gratitude  and  thanks  are  due  to  the  head 
masters,  head  mistresses  and  others,  through  whose  kindness 
I  was  enabled  to  bring  the  research  to  a  satisfactory  conclusion. 

As  a  fact  of  considerable  importance,  it  should  be  added 
that  the  tests  were  so  applied  as  to  disturb  the  ordinary  routine 
of  the  schools  as  little  as  possible. 


Correlation  Results. 

The  values  of  the  frequency  constants  for  the  various  groups 
of  subjects  show  very  clearly  that  any  plan  of  throwing  them 
together  (as  they  stand)  and  working  out  coefficients  from  the 
combined  series  would  produce  a  considerable  amount  of 
"spurious"  correlation  and  make  the  results  almost  valueless. 
One  exception,  indeed,  to  this  state  of  affairs  was  found  in  the 
case  of  Speed  and  Accuracy  of  Addition  [4  (a)  and  4  (b)]  in  Group  I 
and  in  a  group  of  20  boys  not  otherwise  included  in  the  present 
research.  The  means  and  s.D.'s  in  these  two  sets  of  boys  for 
these  two  characteristics  were  found  to  be  the  same,  within  the 
limits  of  probable  error.  These  86  boys  were  therefore  taken 
together  for  this  particular  correlation  and  a  correlation  table  was 
drawn  up  (see  Appendix  II  and  p.  59  above)  whereby  the  value 
of  V  could  be  calculated  as  well  as  that  of  r,  and  the  nature  of  the 
regression  curve  and  regression  line  determined.    To  make  the 
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investigation  into  this  particular  problem  of  speed  and  accuracy 
in  adding  complete,  small  correlation  tables  were  drawn  up  for  all 
the  other  groups,  separately  from  one  another,  and  the  value  of 
7/  was  calculated  in  each  of  these  cases  also  (Appendix  IV). 

Apart  from  these  cases  the  plan  of  grouping  in  correlation 
tables  seemed  quite  unsuitable  for  such  small  numbers.  The 
values  were  therefore  taken  as  they  stood,  but  the  full  product- 
moment  formula,  ,  was  employed  throughout,  with  the 
single  exception  of  Group  V  (6).    Here  the  numbers  were  so 

r       6<s'  (vi  — 

very  small  (23)  that  the  method  of  ranks  - 


2  sin  (J  p) 


was   considered  sufficiently  accurate,  since 


nothing  but  a  general  impression  of  the  nature  of  the  corre- 
lation could  be  expected. 

The  following  Tables  give  the  values  of  the  correlation 
coefficients  between  series  formed  by  the  amalgamation  of  the 
two  measurements  made  in  each  test.  The  numbers  immedi- 
ately below  the  coefficients  are  the  probable  errors,  and  those 
in  thick  type  are  the  reliability  coefficients  (r^)  for  the 
amalgamated  series,  showing  to  what  extent  each  amalgamated 
series  would  correlate  with  another  quite  similar  series.  In 
these  tables,  the  tests  are  arranged  according  to  order  of 
magnitude  of  the  average  correlation  of  each  with  all  the  rest 
(within  any  particular  group)  and  all  coefficients  smaller  than 
their  probable  errors  are  put  down  as  0.  Of  the  coefficients 
recorded,  the  total  number  of  those  >  2  P.E.  is  139,  and  of  those 
>  3  P.E.  the  total  is  86.  The  first  thing  to  be  noticed  in  the 
groups  of  coefficients  arranged  in  this  way  is  that  not  one  of 
them  shows  the  "hierarchical  arrangement,"  and  it  is  a  very 
significant  fact  that  the  group  which  approaches  it  most  nearly 
(Group  I)  is  the  group  where  "  spurious  correlation  "  due  to 
heterogeneity  of  material  was  to  be  suspected  (see  p.  99). 
Now  it  will  be  apparent,  on  the  slightest  reflection,  that  any 
extraneous  source  of  correlation  (such  as,  e.g.,  difference  of  the 
state  of  discipline  to  which  different  members  of  the  group  had 
been  accustomed  immediately  antecedent  to  the  occasion  of 

B.  8 
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GROUP  IV.    (56  ivomen  students.)  [Pi'ovisional 
and  incomplete  table  of  coefficients.] 


Combination 

Addition 

Addition 

Mechanical 

(accuracy) 

(speed) 

Memory 

Combination 

•63 

•53 

•34 

•31 

■06 

•08 

•OS 

Addition 

•53 

•45 

•43 

•20 

(accuracy) 

■06 

•07 

•09 

Addition 

•34 

•43 

•96 

•18 

(speed) 

•08 

■07 

■09 

Mechanical 

•31 

•20 

•18 

•67 

ilemory 

•OS 

■09 

■09 

Variability  coefficient  for  speed  of  addition : 
»Vi+sp.,  =  0-:33  ±^08. 

Sp.l~Sp.2 

Variability  coefficient  for  accuracy  of  addition  : 

^acc.,  +  acc.,  =  -0-66  ±  •05. 
acc.i~acc.2 


[Correlations  of  M.  L.  III.    (10  cases  only*.) 


''M.L.  111.  =  0, 
Comb. 

''M.L.  111.  =  -35  ±  -18, 
Mecli.  Mem. 

^M.L.  111.  =  - -57  ±  •14] 

M.V.  of  M.L.  111. 


'^M.V.  of  M.L.  111.  =  '28  +  -19, 
Comb. 

'"M.V.  of  M.L.  111.  =  --21  +  •20, 
Mecli.  Memory 


*  Owing  to  the  smallness  of  the  .sample,  the  p.e.'s  (calculated  bv  the 
ordmary  formula)  are  too  low,  and  the  results  cannot  be  re^arSd  as  anvth  n^ 
more  than  veiy  rough  approximations.  re^araea  as  anything 
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GROUP  V  (a).    (3o  omii  students.) 


Associa- 

bisection 

tion 

Addition  Combi- 

Addition 

M.L. 

+ 

er 

time 

(acc.) 

nation 

(speed) 

in. 

Trisection 

er 

•97 

-•18 

-•26 

•19 

0 

•42 

-•24 

•11 

•10 

•11 

•09 

•10 

Association 

-•18 

•87 

•39 

•33 

•37 

0 

0 

time 

•11 

■09 

•10 

•09 

Addition 

-•26 

•39 

•36 

-•16 

•38 

0 

0 

(accuracy) 

■10 

•09 

•11 

•09 

Combina- 

•19 

•33 

-•16 

•22* 

•19 

-•24 

0 

tion 

■11 

■10 

■11 

•11 

•10 

Addition 

0 

•37 

•38 

•19 

•97 

0 

•13 

(speed) 

■09 

•09 

•11 

•11 

M.  L.  111. 

•42 

0 

0 

-•24 

0 

•57 

-•29 

•09 

•10 

•10 

Bisection  + 

-•24 

0 

0 

0 

•13 

•29 

B  =  -53 

Tri  section 

•10 

•11 

•10 

T  =  -52 

9  coefficients  >  2  x  p.  e. 


5  coefficients  >  3  x  p.  E. 


*  The  second  test  only  was  used  in  this  case  for  correlation  with  other  tests 
since  the  low  correlation  between  the  two  was  almost  certainly  due  to  the 
unsatisfactory  nature  of  the  first.    The  value  -22  is,  then,  r, . 
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[GROUP  V  (b).    (23  luomen  students*.) 
{Method  of  ranks  used      p  =  1  -  ' 

r=2sm  p^.] 


Associa- 

Bisection 

Addition 

M.L. 

tion 

Combi- 

-1- 

Addition 

111. 

time 

nation 

Trisection 

(acc.) 

(speed) 

er 

M.  L.  111. 

•68 

-•78 

-•90 

-•84 

-  53 

-•66 

-•42 

■05 

•OS 

■04 

•10 

•08 

•11 

Association 

-•78 

•67 

•58 

•53 

•58 

•35 

■43 

time 

■05 

•09 

•10 

•09 

•12 

•11 

Combina- 

-•90 

•58 

•82 

0 

•28 

•13 

•52 

tion 

■03 

•09 

•IS 

■u 

•10 

Bisection  + 

-•84 

•53 

0 

B  =  ^61 

•29 

•40 

0 

Trisection 

•04 

■10 

T  =  ^93 

•13 

•12 

Addition 

-•53 

•58 

•28 

•29 

•74 

•35 

0 

(accuracy) 

■10 

■09 

•13 

■13 

•12 

Addition 

-•66 

•35 

•13 

•40 

•35 

•99 

0 

(speed) 

•08 

•12 

•u 

•12 

•12 

er 

-•42 

•43 

•52 

0 

0 

0 

•58] 

•11 

■11 

■10 

16  coefficients  >  2  x  p.  e. 


12  coefficients  >  3  X  p.  E. 


*  These  results  are  recorded  as  avowedly  rough  approximations  only,  owing 
to  the  smallness  of  the  sample. 
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applying  the  tests)  the  influence  of  which  is  in  a  constant 
direction  but  varies  in  amount  from  test  to  test  according  to 
the  varying  degrees  to  which  the  individual  tests  are  susceptible 
to  its  influence,  must  tend  to  produce  the  hierarchical  arrange- 
ment, and  unless  counteracted  by  other  more  potent  tendencies, 
will  do  so.  In  fact,  it  will  be  the  "central  factor"  supposed 
to  be  indicated  by  such  a  form  of  arrangement  of  coefficients. 
Spurious  correlation  of  this  nature  might  arise  from  the  use  of 
unfamiliar  apparatus  in  the  tests,  or  from  the  novelty  of  the 
tests,  or  in  many  other  ways.  The  form  of  procedure  adopted 
in  the  present  research  was  specially  devised  to  reduce  such 
extraneous  sources  of  correlation  to  a  minimum,  being  assimi- 
lated as  far  as  possible  to  the  ordinary  class- work  of  the  school. 

A  definite  solution  of  the  question  of  the  existence  or  non- 
existence of  one  central  mental  ability  is  yet  to  be  sought.  It 
can  only  be  obtained  by  the  use  of  much  larger  random  samples 
than  those  hitherto  employed,  since  the  probable  errors  must  be 
small  compared  with  the  coefficients,  if  precise  inferences  are  to 
be  drawn  from  the  latter,  and  in  the  case  of  small  samples,  this 
condition  is  satisfied  only  for  large  correlation  coefficients,  which 
when  obtained  are  often  merely  the  result  of  selecting  tests 
which  measure  closely  similar  mental  abilities.  In  all  results 
hitherto  quoted  in  support  of  ultimate  identity  of  general 
intelligence  and  general  sensory  discrimination,  the  correlations 
contributed  by  the  latter  are  so  small  compared  with  their  p.e.'s 
that  nothing  definite  can  be  inferred  from  them.  On  the  other 
hand,  in  such  cases  it  is  easy  to  propound  hypotheses  since  the 
bounds  of  possibility  are  nowhere  limited  in  any  unambiguous 
way. 

Certain  sub-groups  can  be  chosen  from  the  tables  so  as  to 
show  a  hierarchical  arrangement,  e.g.  Group  III,  School  Marks, 
General  Intelligence,  Mechanical  Memory,  and  Combination. 
In  fact  the  general  law, — so  far  as  the  results  allow  of  the  con- 
fident formulation  of  any  law  at  all — would  seem  to  be  that  the 
tests  fall  into  a  number  of  such  sub-groups,  correlating  highly 
among  themselves,  but  not  at  all  highly  with  members  of  other 
sub-groups,  though  an  individual  member  of  one  sub-group  may, 
exceptionally,  correlate  highly  with  an  individual  of  another 
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Eank 

II 

Eank 

III 

Eank 

1 

Combination 

3 

Combination 

3 

2 

Mech.  Memory 

4 

er 

1 

3 

an  OS 

2 

Addition  (sp.) 

5 

4 

e  r 

1 

a  n  0  s 

2 

5 

Motor 

7 

Mech.  Memory 

4 

6 

Addition  (sp.) 

5 

V.  H.  111. 

8 

7 

V.  H.  III. 

8 

Motor 

7 

8 

Addition  (acc.) 

6 

Addition  (acc.) 

6 

sub-group.  In  order  to  bring  out  these  relations  more  clearly 
and  also  to  show  the  relation  of  the  main  groups  with  one 
another,  the  table  on  pages  122  and  123  was  drawn  up.  The 
results  there  to  some  extent  explain  themselves.  Differences 
in  correlation  between  the  two  sexes,  though  well  marked,  do 
not  seem  to  follow  any  general  law.  On  the  whole,  the  correla- 
tions are  lower*  in  the  girls  than  in  the  boys,  higher  in  the 
women  than  in  the  men.  A  striking  feature  is  the  fairly  large 
jiumber  of  instances  of  negative  correlation  in  the  girls  corre- 
sponding to  positive  correlations  in  the  boys. 

Comparing  relative  order  of  tests  in  the  tables  for  Groups 
I,  II,  and  III,  I  get : 
I 

e  r 

an  o  s 

Combination 
Mech.  Memory 
Addition  (sp.) 
Addition  (acc.) 
Motor 
V.  H.  111. 

The  rank-correlations  here  are 

Ph  II  =  '66,  pi^  III  =  -74,  pn.  III  =  "55. 

As  might  be  expected,  the  two  groups  of  boys  correspond 
more  closely  with  one  another  than  either  does  with  the  group 
of  girls.  (Compare  value  deduced  from  Mr  Burt's  results, 
p.  93  above.) 

The  method  of  multiple  or  "partial"  correlation  (see  pp.  52, 
63,64  above)  may  be  very  advantageously  employed  to  investigate 
the  way  in  which  the  correlation  coefficients  are  related  to  one 
another.  Thus,  taking  the  three  variables,  er,  anos,  and  motor, 
and  using  the  formula 

1  get  the  following  values  for  the  partial  correlations  in  the  first 
three  groups : 

Group  I  Group  II  Group  III 

er,  anos  -80  + -03  -82 ±-04  -76 +  -04 

er,  motor  .59±-Of,  .55 +  .03  .37 ^.qq 

anos,  motor       ^    -•38±-07  --SOt-OO  -■2S±-\0 

*  This  may  be  duo  to  greater  variability. 


a 

(U 

a 

 s-*^ 


a 


+1  +1  +1  +1  I 

CO  00  c» 


CO 


o  o 


l-H  !— (  i-H  O 
1— 4  I— I  1— I 

+1  +1  +1  +1 

05  05  CO  CO 

(-H  i-H  1 — I  CO 


^co 


I 


+1 


o 

+1 

00 


3 
O 

o 


a 

o  fl 


O  H 

n 

HI 


00 

p 

+1 


00  CO 

o  o 

+1  +1 

^  CO 

e<5  lo 


+1  +1 

00  o 


CD  05  O  05  1— I  O 

Cp  p  p  r— I  p  r— I 

+1  +1  +1  +1  +1  +l'o +10" 


05  00  05 

000 


O  Oi  CD  00  -*  O 
CO  CO  -ft  (N  .-H 


•  ■•III'      "      ■      "  ■ 

+1  +1  +lp  o  o  o 

O  O  00 
^  C<J 


lO  O  t— ' 
O  I— I 

+1  +1  +1 

-t  O  r-l 
I;-  r-l 


10 
PO 


P 
+1 

CO 

'J* 


+10  +1 


CO 


.a  ii 


0005005  1— II— 10  t-00 

pOi— 10  I— it^i— I  00 

+1  +1  +1  +1  i  g+l  +1  +1  I        +1  ihl 

'*00tMt~  lOCCiO  05iO 

ip  ^  Oi  Cpi  T^T''?'  'P'P 
.     .  ^. 


+lp,+|  +1 

CO  CO 

I— I  (M 


+100. 


O 


o 

00 


+10 

CO 


I 


+1 

CM 


a 


(U 


CO  CO 

p  p  p  p  p  p 

+1  +1  +1  +1  +1  +1 

CN  00  lO  O  00 
lO  lO       ^  ^  CO 


GO        00  00 

00000 


CO  CD  00 

000 


O  00 

o  o 


I  +1 +1+1 +1+1 

05  05  O  i-( 

Tt<  CN  ^  cq  CO 


+1  +1  +1   +1  +1 


00  r-l 

r~  lO  oj 


C5  O 

in  CO 


n3 


01 

o 


S  O 


a  g 


a) 


03 

5  J 


r;        CO  o  fH 

S  «  a 

^  i3  CD 


a  13 

00  o  o 


T3 

ce 
o 

!S  to 


a  ^ 

bO  o 

c  .0 


00 
p 

+1 

CO 


s  8 


O 

c 

O  m 
O  r==H  1-^ 


o 

«  ^ 

2  s 

«^  c  fi  2 

73  o  P  - , 


03 


«  a 
£•2 


o  s- 
13  ■» 


o  a  o 
^3  ^  ™  Sh 

&10  §  o 
to  eS  S  =S 

G  C-^-C 
000 

^ 


00 

o 


+1  I  +1 


CD 
CO 


05 
O 

+1  IS 


+IO  +lp, 


o 


+1 


o  o    o  o  o  o 

I— I  I-H  r-H  I-H  ! — I  I— I 

•fl+lg+l  +1  +1  +1 

o  OS     CM  -H  in 

M  CO  «  CO 

'l  '   "l  'l  I 


o     o  o  o  ^ 

I-H  I — I  I — 1  I-H  I-H 

o^+l  +1  +1  +1  +1 
'   '-^     CO  o  o  ^ 

rH         CO  CO  ^0  ry' 
"       I      I  I 


+  1 

CO 


o 

+1 

I-H 

'l 


00  o 

O 

+  1+1 

O  C<1 


00  03  05  C5 
p  p  p  p 

+1  +1  +1  +1 

I-H  oq  ^  OJ 
»0  ^  ^  CO 


00  o  o  o 

P  -7"'  rH  7H 

+1+1  +1  +1 

C5  rH  ro  o 

-*  (N  CO  CO 


Oi  o  o 

O  7H  I-H 

+1+1  +1 

^  I-H  I-H 

qq  (jq 
'1  '1  '1 


I 


O  rH 


O  05 
rH  O 


+1  +1  +1        +1  +1 


O  rH  !M 
(M  01  rH 


rH  O 


rH  t^q 


+1 

CO 


o  o 


CO  00  CO 

pop 

+1  +1  +10] 

CO  rH  O 

lO  fM  (M 


CO 
+1 

rH 


00 

p 

+1 

rH 

n 
I 


*  S 

M   OJ  o 

o  a,  o 


T3 

a 


2  ^  ^ 

-1  K* 


Q  O 


23  £.2.2 

o  aj  © 

' — ' 


;h  . 


O 

c 


a 


-l->  -u 

HJ  -u 
Uj  UH   1"  u 


0) 

— -o 

o  ce ; 
o  I 


93 

o 
C 

aj 

03  ^ 

£  a 
!3  P.2 

1— I  I-H  +2 

-  g-S 

O   'h  o 

o  5^ 
a  c 

o  >H  o 


c 


tcj  a 


)A 


T3 

a 

•  S  9 


O   >l  Si 

o  is  2 

1=1  P-l  '3! 

aj  1^ 

+^  ;h  , 

a  o 

■     ■  t+H 

o 


•s:  2 


be 


a  a  _ 

S  a*  <u 


rd 
o 


124 


SOME  EXPERIMENTAL  RESULTS 


Thus  the  original  positive  correlation  between  anos  and 
motor  is  due  entirely  to  the  correlation  of  each  with  er.  For 
"er  constant,"  the  correlation  is  large  but  negative,  in  all  three 
cases. 

Employing  the  four  variables.  School  Marks,  General  In- 
telligence, Combination  and  Mechanical  Memory  (Group  II) 
and  using  the  formula 

  ^"12  (1       ^'^34)  ~  ''^13  (^'23       ^24^^34)  ~  ^"t4  (^24  ^23^34)  

?*]2  34  —      .  ~         —   ,  ■  '■  t 

V 1  -  r^,  -  ?    -  r^34 + 27^13       V 1  -  r\,  -r\^-  r'^ + 2    r.^  r.^ 

(see  p.  64),  I  find  the  correlation  between  Combination  and 
General  Intelligence,  assuming  constant  ability  in  Mechanical 
Memory,  and  as  shown  by  School  Marks 

=  0-11  ±0-11. 

The  "  entire  "  coefficients  =  0-43  +  '09.  It  is  interesting  to 
note  that  there  is  still  correlation,  though  very  slight,  after  the 
effect  of  memory  and  school  industry  and  abihty  is  eliminated. 

A  more  thoroughgoing  application  of  the  method  of  partial 
correlation  was  made  in  the  case  of  Group  I,  for  the  tests  er, 
anos,  Combination  and  Mechanical  Memory.  The  full  working, 
according  to  Mr  G.  U.  Yule's  method,  is  given  in  Appendix  III. 
The  results  were : 

Tests  Correlated 

e  r,  ca  11 0  s 
er,  Combination 
er,  Mechanical  Memory 
anos,  Combination 
anos,  Mechanical  Memory 
Combination,  Mechanical  Memory 

The  regression  equation  for  the  calculation  of  ability  in 
combination  from  abilities  in  the  other  three  tests  is 

Wf.  =  •002it;er  +  'CQQiranos  +  'TOS/CHicch.  memory. 

X  in  each  case  denoting  deviation  from  mean  ability. 

For  a  larger  number  of  variables  than  four,  the  arithmetic 
of  partial  correlation  becomes  extremely  lengthy  and  rather 
fatiguing,  but  there  can  be  no  doubt  whatever  that  this  is  the 
one  sound  method  to  adopt  in  investigatuig  the  relations 


Entire 

Partial 

Coefficients 

Coefficients 

0-78  ±-03 

0-73  ±-04 

0-45  ±-07 

0-01  ±-09 

0-40+ -07 

0-26  ±-08 

0-48  ±-07 

0-27  ±  -08 

0-29  ±  -08 

-0-15  ±-08 

0-52  ±-06 

0-44  ±-07 
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between  coefficients.  Working  with  a  large  number  of  variables 
is  only  satisfactory  when  the  original  coefficients  are  large  com- 
pared with  the  p.E.'s,  since,  as  a  rule  (not  universally*)  the  partial 
coefficient  is  smaller  than  the  entire  coefficient.  The  formula 
for  the  r.E,  is  the  same  in  both  cases. 

It  must  also  be  remembered  that  the  partial  correlation 
formulae  assume  linear  regression.  In  the  case  of  the  correla- 
tion of  Addition  (speed)  and  Addition  (accuracy)  throughout 
the  Groups,  and  in  the  case  of  Combination  and  Mechanical 
Memory  in  one  Group  (Group  I),  the  "correlation  ratio"  {n)  was 
determined  and  compared  with  the  corresponding  value  for  r. 
The  results  are  given  in  Appendix  IV,  together  with  the 
regression  curves.  It  will  be  observed  that  although  the  criterion 

(  '^-^  .  '{jTjzll]  works  out  <  2"5,  there  seems  to  be  a  small 
V67449      2  / 

but  well-defined  deviation  from  linearity  in  all  the  cases  of 
Addition  (sp.  and  ace),  (see,  too,  the  regression  curve  in 
Appendix  II).  r}  is  in  all  cases  fairly  high,  and  the  general 
form  of  curve  is  approximately  identical  in  all.  With  increase 
of  speed,  the  accuracy  may  at  first  decrease  (Groups  III  and 
VI),  but  after  that,  it  in  all  cases  increases,  then  decreases,  then 
increases  once  more,  and  in  cases  III  and  V  (a)  it  finally 
decreases. 

Variabihty  coefficients  were  obtained  for  Speed  of 

Addition  and  Accuracy  of  Addition  in  Group  IV,  and  were 
found  to  be 

0-33  ±  -08  and  -  0-66  +  '05,  respectively. 

A  full  analysis  of  the  entire  data  in  this  and  other  additional 
ways  must  be  reserved  for  a  later  occasion. 

As  mentioned  above,  pp.  84  and  85,  tests  were  made  of  the 
applicability  of  Spearman's  correction  formula,  with  results 
which  precluded  the  use  of  the  formula.  The  question  of 
correlation  of  errors  of  measurement  with  the  true  values  of  the 
variates  and  with  one  another  has  been  made  the  subject  of  a 
separate  piece  of  research  by  the  present  writer.  The  results 
of  this  investigation  will  be  published  shortly. 

*  See,  e.g.,  p.  52  above. 
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In  the  case  of  the  Vertical-Horizontal  Illusion  Test,  it  is 
perhaps  of  interest  to  note  that  if  subjects  showing  a  negative 
value  of  the  illusion  are  excluded  the  value  of  the  correlation 
between  this  Illusion  Test  and  the  Combination  Test  becomes 
positive  and  appreciable— in  Group  IV 

^Combiimtiou  =  0-24  ±  O'OQ, 
\.  H.  111. 

(one  test  only  of  V.H.  111.),  in  Groups  I  +  III,  it  =  0-26  ±  0-08 
(this  latter  value  probably  includes  some  "spurious"  correlation: 
see  tables  pp.  107,  110).  If  negative  values  are  included,  we 
have  the  results : 

^Combination  • 
V.  H.  111. 

Group  I,  0 ;  Group  II,  0-22  ±  10 ;  Group  III,  -  -11  +  11. 

^Gen.  Intelligence  • 
V.  H.  111. 

Group  I,  0 ;   Group  II,  0 ;   Group  III,  -  -30  ±  10. 
Conclusions. 

1.  The  correlation  between  different  psychical  abilities  is 
not  very  close.    Few  coefficients  are  greater  than  '6. 

2.  The  size  of  the  correlation  coefficient  varies  greatly 
from  one  group  of  subjects  to  another.  This  shows  how  great 
is  the  danger  of  spurious  correlation,  due  to  heterogeneity  of 
material,  in  psychical  measurements. 

3.  The  Combinations-Methode  of  Ebbinghaus  is  a  good 
measure  of  intellectual  ability.  It  correlates  with  "general 
intelligence "  almost  as  closely  as  "  scholastic  intelligence " 
(school  marks)  does. 

4.  Mechanical  memory  correlates  fairly  closely  with  in- 
telligence. 

5.  Drawing  also  correlates  closely  with  intelligence  in  the 
case  of  higher  grade  school  boys.  On  the  other  hand,  for  girls 
of  the  same  age  the  correlation  is  nil. 

6.  The  susceptibility  to  the  Muller-Lyer  Illusion  is  not 
at  all  closely  correlated  with  that  to  the  Vertical-Horizontal 
Illusion.  This  indicates  that  the  factors  involved  are  for  the 
most  part  different  in  the  two  cases. 
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7.  The  "mean  variation"  of  the  Vertical-Horizontal  Illusion 
is  positively  related  to  the  size  of  the  illusion  (Group  I,  '21  ±  '08, 
Group  III,  -33  +  -10). 

8.  In  the  case  of  the  Muller-Lyer  Illusion,  there  is  a  zero 
or  slightly  negative  correlation  between  M.v.  and  size. 

9.  The  two  optical  illusions  correlate  differently  with 
drawing,  the  Miiller-Lyer  negatively  (Group  II,  -  '44  +  "09, 
Group  III,  -  -19  +  -10),  the  Vertical-Horizontal  positively  or 
not  at  all  (Group  II,  '27  ±  "lO,  Group  III,  0). 

10.  The  mean  variations  of  the  illusions  show  a  zero  or 
very  slight  negative  correlation  with  other  measures  of  mental 
ability. 

11.  The  correlation  between  speed  and  accuracy  of  mental 
performance  is  slightly  non-linear. 

12.  Correlations  may  be  very  low  even  within  a  set  of 
mental  tests  which  appear  to  measure  closely  related  mental 
abilities,  and  this  when  the  reliability  coefficients  are  high. 
Thus,  the  correlation  between  erasing  the  letters  a,  n,  o,  s  and 
erasing  all  the  letters,  is  less  than  three  times  the  probable 
error  in  every  group  tested.  For  "erasing  the  letters  e,  r" 
constant,  the  partial  correlation  is  negative  and  numerically 
greater  than  3  p.e.  in  each  group. 

13.  In  homogeneous  groups  of  subjects  there  is  no  positive 
evidence  of  the  existence  of  one  "  central  factor  "  to  which  the 
correlations  between  the  individual  mental  abilities  may  be 
regarded  as  due. 

14.  There  is  in  several  cases  a  pronounced  correlation 
between  ability  and  variability.  This  may  be  either  positive 
or  negative.  It  is,  e.g.,  positive  for  speed  in  adding  figures, 
negative  for  accuracy. 

15.  There  is  definite  evidence  that  variability,  as  measured 
by  the  coefficient  of  variation,  is  greater  in  boys  than  in  girls. 
Differences  in  correlation,  though  sometimes  appreciable,  seem 
to  follow  no  well-defined  rule. 


CHAPTER  IV 


THE  SIGNIFICANCE  OF  CORRELATION  IN  PSYCHOLOGY 

The  full  significance  of  correlation  in  Psychology  is  to  be 
found  in  the  general  theory  of  multiple  correlation,  of  which 
the  correlation  of  two  variables,  measured  by  the  correlation 
coefficient,  is  only  a  particular  case.  Karl  Pearson's  classic 
deduction  of  the  product-moment  formula*  proceeds  on  these 
lines.  We  will  therefore  attempt  to  state  the  general  problem, 
so  that  we  may  have  something  tangible  on  which  to  base  our 
discussion. 

Let  there  be  n  physical  (or  psychical)  quantities 

Xi,  OC2,  OSg  ...  , 

which  we  observe  and  find  correlated  together;  and  let  us 
assume  that  there  are  vi  causes 

which  may  be  physical,  biological,  or  psychological,  but  which 
are  absolutely  independent  of  one  another.  We  may  further 
assume  that  the  xs  are  given  by  functions  of  the  e's,  thus : 

X2  =  f-1  (ei,  fo,  ey  ...  e,„). 


Xn—fn  i^i,        63  ...  6„i). 

All  the  e's  need  not  occur  in  each  aj-function. 

*  "Regression,  Heredity  and  Panmixia,"  Phil.  Trans.  A.  Vol.  cxcviii. 
pp.  443-459.  The  following  proof  is  based  partly  upon  a  lecture  demonstration 
given  by  Prof.  Pearson. 
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We  further  assume  that  m  >  n,  since  otherwise  the  m  e's 
can  be  eliminated  from  these  n  equations. 

Then  our  general  problem  is  this :  supposing  the  independent 
variables,  e,  each  follow  a  Gaussian  distribution,  what  will  be 
the  frequency  surface  connecting  the  xs  ? 

Assume  that  the  variations  of  e  are  small  relative  to  their 
mean  values.  In  this  case  we  may  expand  the  functions  and 
retain  only  the  first  powers,  in  other  words  we  may  make  the 
a;'s  linear  functions  of  the  e's. 

Let  I's  be  deviations  of  x's  from  mean  values, 
and  Tj's  „  „       „  e's  „ 

Then  =  anVi  +  ch2V2  +  . . .  +  a^rnVm] 

^2  =  0^21 ''71  +  Cf22^2  +  . . .  +  a^inVm  , . . 


nm  ''7m  / 

By  hypothesis,  the  77's  are  independent  and  follow  the 
Gaussian  distribution.  Hence 

the  chance  of  a  complex  of  variables  lying  between 

Vi,  Vi  +      =  Ci  X  e  ^'^'dvi, 
the  chance  of  a  complex  of  variables  lying  between 


the  chance  of  a  complex  of  variables  lying  between 


Vm,  Vm  +  ^V7n  =  C,n  X  6   2«m  dT}r. 

Whole  chance  =  product  of  these. 

We  can  now  eliminate  one  of  the  ?7,'s  in  the  following 
manner. 

Let  us  solve  the  n  equations  (i),  finding  the  value  of 

Vi,V2'--Vn 

in  terms  of 

^1  —  (hn+iVn+i  —  0,in+2Vn+2  "...  —  ai^Vm, 
^3  —  0,m+iVn+i  -  a2,i+2'»7n+2  —  ...  -  Cl2m'7?>i, 


?n  -  a 


»in+i'7«H-i  ~  0.)in+2''7n+2  —  ...  —  anmVi 
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Since  these  equations  are  linear,  our  chance  of  a  complex 
of  variables  lying  between 

and  f  1  +     ,  ^,  and  ^,  +  8^,,...^^  and  f  „  +  g^,, 
Vn+i  and  Vn+i  +  8r)n+u  ...  r),n  and  Vvi  +  8r],n 
will  be  of  the  form 

Const.  X  e-k.-ka-kax  BxM.  ...       X  S77„+,  ...  S7;^...(ii). 
In  this  expression      is  a  function  of  the  f's  only,  q.^  is  a 
function  of  rjn  to  t;^  only,  and  q^  is  a  function  of  the  products 
of  ^'s  into  7}'s. 

In  order  to  find  the  chance  that  our  complex  of  variables 
lies  between  ^j,  fi+S^i  up  to  ^n,  ^n  +  8^n,  we  must  integrate 
expression  (ii)  for  all  values  of  77^+1  up  to  ijm  between  the 
limits  +  00  and  —  00  . 

Integrating  with  regard  to  77,1^.1,  we  find  that  the  expression 
keeps  its  general  form,  only  rjn+i  has  disappeared.  If  now  the 
process  be  repeated  with  77^+2,  the  integration  will  again  leave 
the  expression  of  the  same  form  except  that  77,1+2  will  have 
disappeared.  Continuing  this  for  an  indefinite  number  of 
77  integrations,  all  the  77's  will  at  length  disappear,  and  we 
shall  be  left  with  a  quadratic  function  of  ^'s  only.  In  other 
w^ords,  the  chance  of  a  complex  of  variables  lying  between 

and  ^i  +  S^i,  ^2  and  ^2  +  ^?2,  •••      and  ^n  +  S^n 
=  Const.  X  -  +P«»f«'+2^.af.f2+2(7.af.f3+  ■■■)d^4^,  ...d^n--  -(iii). 

This  expression  represents  the  general  form  of  the  frequency 
surface  connecting  the  x's. 

It  may  be  noted  that  the  fi*equency  will  be  constant  when 
9u^i'+922^2'+  ■  ■  ■  +^nn^n'+25'i2^i?2+2sri3?i^3  +  •  •  •  =const.. .  .(iv). 
This  represents  an  ellipsoid  in  n-dimensional  space. 
Taking  the  simplest  case  of  two  variables,  expression  (iii) 
becomes 

Oe-i(fi'.f.""+'?»f»'+2f d^,d^^   (v). 

Integrating  (v)  for  all  values  of  from  -  00  to  +  00 ,  we 
get  the  Gaussian  curve  of      variation,  so  that 

1  _    ^  9^l\ 
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Similarly,  integrating  for  all  values  of 

If  now  we  integrate  for  all  values  of  and  we  get  the 
total  frequency,  N : 

Ctt 

N  = 


^gig-^-gi'' 

Writing  a;  for  fi,  y  for      and  r  for  -       ,  we  have  as  the 
equation  to  the  correlation  surface 

N   (VI). 

^  27r(riO-2\/l-r^ 
?•  in  expression  (vi)  is  the  coefficient  of  correlation.    That  it  is 
a  measure  of  correlation  is  apparent  from  the  fact  that  with 
no  correlation  the  compound  probability  for  two  variables, 

whose  separate  variations  are  represented  by  e  2<Tf  and  e  ao-^-, 

_i/5i  ,  ^\  2rxy  . 

would  be  proportional  to  e  -^W  The  term  in  the 

index  marks  the  difference  between  the  two  expressions. 

(•+00  r+oo 

It  can  be  shown  that  S(xy),  or  zxydxdy,  is  equal 

J  —ao  J  —  00 

to  Nra-iO-^. 

S  (xy)  ,  ... 

The  correlation  surface  represented  by  equation  (vi)  is  in 
three  dimensions  of  space*,  and  is  such  that  vertical  sections 
parallel  to  the  two  axes  of  coordinates  are  all  noi'mal  curves 
and  horizontal  sections  (i.e.  the  "  contour  lines  ")  are  similar  and 
similarly  situated  ellipses.  With  respect  to  these  ellipses  the 
two  regression  lines  are  conjugates  to  the  axes  of  x  and  y 
respectively. 

Other  properties  of  the  surface  are : 

(1)  The  means  of  parallel  arrays  all  lie  on  a  straight  line, 
i.e.  the  regression  is  linear. 

(2)  The  standard  deviations  of  all  parallel  arrays  are  equal. 
*  See  Fig.  29  in  G.  Udny  Yule's  Theory  of  Statistics,  p.  166. 

9—2 
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It  should  be  noted  that  although  normal  distribution  of 
the  two  variates  involves  linear  regression,  the  converse  is  not 
always  true.  For  this  reason  the  proof  of  the  product-moment 
formula  in  Chap.  I  is  more  general  than  that  given  in  the  present 
chapter ;  it  assumes  linear  regression,  but  is  quite  independent 
of  the  form  of  distribution.  Similarly,  the  formulae  for  partial 
regression  follow  from  the  proof  of  the  present  chapter,  but 
they  can  also  be  obtained  more  generally  on  the  assumption 
merely  of  linear  regression,  independent  of  form  of  distribution 
(see  Chap.  I,  pp.  59,  63). 

By  means  of  the  formulae  for  multiple  or  partial  correlation 
we  are  able  to  deal  with  more  than  two  variables,  and  to 
determine  the  relation  between  any  two  in  independence*  of 
the  influence  of  all  the  rest.    Hence  in  psychology,  where  the 
problem  of  analysis  is  one  of  the  most  important,  the  theory 
of  partial  correlation  is  of  the  utmost  value.    By  its  means 
the  mental  abilities  of  a  homogeneous  group  of  individuals  can 
be  analysed  out  and  arranged  in  order  of  closeness  of  inter- 
relation to  one  another;  in  other  words  the  abilities  of  the 
group  may  be  exhibited  as  a  system,  quantitatively  determined 
in  its  various  parts.    We  must  remember,  however,  that  the 
formulae  are  only  applicable  in  cases  of  linear  regression. 
Evidence  for  this  should  therefore  be  forthcoming  in  all  future 
researches  aiming  at  completeness ;  r  should  be  tested  by  rj. 

Logicians  remind  us  that  Mill's  "  Method  of  Concomitant 
Variations  "  is  to  be  regarded  not  as  a  method  of  scientific  proof 
but  as  a  fruitful  source  of  suggestions  for  scientific  hypothesis. 
So  it  is  with  correlation.  A  coiTelation  coefficient  is  a  state- 
ment of  probabilities.  It  does  not  prove  anything.  It  merely 
suggests  a  hypothesis  as  regards  causation  within  a  particular 
sphere.  The  investigator  must  go  beyond  the  coefficient  to 
the  facts  themselves  for  his  scientific  hypothesis.  In  the  light 
of  this  the  coefficient  gains  new  significance  and,  if  carefully 
and  accurately  calculated,  is  now  of  absolute  worth  as  a 

*  It  is  important  to  understand  exactly  in  what  sense  this  statement  is 
correct.  We  do  not  eliminate  the  other  variables  when  we  employ  the  formula 
for  partial  correlation  ;  we  only  make  them  constant,  i.e.  we  eliminate  the  effect 
of  their  variation. 
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quantitative  measure  of  a  concomitance  of  variation  explained 
by  a  reference  to  more  ultimate  factors.  For  reasons  already 
set  out  in  earlier  chapters,  a  true  correlation  coefficient  is  not 
easy  to  obtain  in  psychology.  Hence  it  is  probable  that  much 
of  the  statistical  work  done  by  psychologists  during  the  next 
few  years  will  be  limited  to  an  effort  to  obtain  more  reliable 
values  for  their  correlation  coefficients.  At  the  present  time 
coefficients  obtained  by  diflferent  people  and  even  by  the  same 
people  on  different  occasions  and  when  working  with  different 
groups  of  subjects  show  a  great  lack  of  agreement  with  one 
another.  So  far  as  this  is  due  to  differences  in  the  "irrelevant" 
conditions  of  the  investigations  a  careful  statement  and  analysis 
of  these  conditions  and  a  comparison  of  those  in  one  investiga- 
tion with  those  in  another  are  likely  to  explain  the  discrepancies 
and  so  turn  them  to  good  account.  From  the  nature  of  the 
case,  absolute  agreement  is  not  to  be  expected.  If  the  dis- 
crepancies stimulate  the  investigator  to  go  behind  the  coefficients 
to  the  facts  themselves,  the  cause  of  science  will  be  better 
served  than  if  a  closer  agreement  had  been  originally  attained. 

It  is  well  to  bear  constantly  in  mind  that  a  correlation 
between  two  mental  capacities  does  not  always  and  necessarily 
indicate  identity  or  partial  identity  of  certain  part-capacities ; 
it  may  be  due  to  identity  or  partial  identity  of  environment 
(taken  in  its  widest  sense).  Special  teaching  may  produce  a 
correlation  between  two  school-subjects  which  would  not  be 
found  under  ordinary  conditions.  Again,  response  to  a  common 
environment  may  have  produced,  in  the  course  of  evolution,  a 
correlation  between  two  mental  functions  introspectively  quite 
dissimilar  to  one  another. 

Partial  correlation  will  carry  us  far  in  the  unravelling  of 
the  tangled  complex  of  mental  ability.  It  will  also  serve  to 
give  quantitative  precision  to  results  when  classified  in  the 
light  of  an  explanatory  hypothesis.  For  the  production  of  the 
hypothesis  itself  we  must  look  elsewhere,  viz.  to  psychological 
analysis  and  psychological  insight.  . 
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Fechners  Fundamental  Table. 


11 

t  =  7i  5 

n 

t  =  hS 

11 

t-hS 

—  

0-50 

0-0000 

0-67 

0-3111 

0-84 

0-7031* 

U  0  i 

U  Ul  /  / 

U  DO 

U  ooUi 

U  OO 

u  /  ozy 

0-52 

0-0355 

0-69 

0-3506 

0-86 

0-7639 

0-53 

0-0532 

0-70 

0-3708 

0-87 

0-7965 

0-54 

0-0710 

0-71 

0-3913 

0-88 

0-8308 

0-55 

0-0888* 

0-72 

0-4121 

0-89 

0-8673 

0-56 

0-1067* 

0-73 

0-4333 

0-90 

0-9062 

0-57 

0-]  247 

0-74 

0-4549 

0-91 

0-9480* 

0-58 

0-1427* 

0-75 

0-4769 

0-92 

0-9935* 

0-59 

0-1609 

0-76 

0-4994 

0-93 

1-0435* 

0-60 

0-1792* 

0-77 

0-5224 

0-94 

1-0993* 

0-61 

0-1975 

0-78 

0-5460 

0-95 

1-1630* 

0-62 

0-2160 

0-79 

0-5702 

0-96 

1-2380* 

0-63 

0-2346* 

0-80 

0-5951 

0-97 

1-3300* 

0-64 

0-2535 

0-81 

0-6208 

0-98 

1-4520* 

0-65 

0-2725 

0-82 

0-6743 

0-99 

1-6450 

0-66 

0-2916* 

0-83 

0-6747 

1-00 

If  n  is  <  0-5,  look  in  the  Table  not  for  n  but  for  1  -  n,  and 
take  t  negative.    Thus  the  tiorn  =  0-25  is  -  0-4769. 

*  The  first  two  tables  in  this  appendix  are  quoted  from  E.  B.  Titchener, 
Experimental  Psychologij,  Vol.  ii.  Student's  Manual,  pp.  99  and  101.  The 
"  starred"  values  in  the  first  table  are  the  corrected  values  given  by  H.  Bruns 
(Wahrscheinlichkeitsrechnung  und  Kollektivmasslehre,  1906),  and  quoted  by 
F.  M.  Urban  [Psychological  Review,  Vol.  xvii.  1910,  p.  251).  Sheppard's  Tables 
of  the  Probability  Integral  (Biometrika,  Vol.  ii.)  are  more  generaUy  useful  than 
this  one  of  Fechner's. 
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Muller's  Table  of  Coefficients  of  Weights. 


n 

w" 

n 

w" 

n 

id" 

0-50 

1-000 

0-67 

0-824 

0-84 

0-373 

0-51 

0-999 

0-68 

0-803 

0-85 

0-342 

0-52 

0-997 

0-69 

0-782 

0-86 

0-311 

0-53 

0-994 

0-70 

0-760 

0-87 

0-281 

0-54 

0-990 

0-71 

0-737 

0-88 

0-251 

0-55 

0-984 

0-72 

0-712 

0-89 

0-222 

0-56 

0-977 

0-73 

0-687 

0-90 

0-193 

0-57 

0-969 

0-74 

0-661 

0-91 

0-166 

0-58 

0-960 

0-75 

0-634 

0-92 

0-139 

0-59 

0-950 

0-76 

0-606 

0-93 

0-114 

0-60 

0-938 

0-77 

0-578 

0-94 

0-089 

0-61 

0-925 

0-78 

0-550 

0-95 

0-067 

0-62 

0-911 

0-79 

0-521 

0-96 

0-047 

0-63 

0-896 

0-80 

0-492 

0  97 

U  Uza 

0-64 

0-880 

0-81 

0-463 

0-98 

0-014 

0-65 

0-862 

0-82 

0-433 

0-99 

0-004 

0-66 

0-843 

0-83 

0-403 

The  weight  of  an  n  which  is  <  0*5  is  the  same  as  the  weight 
of  an  n  which  exceeds  0-5  by  the  same  amount.  Thus  the 
weights  of  n  =  0*25  and  of  n  =  0-75  are  both  alike  =  0'634. 


Improved  Table  of  Weights*. 


n 

w 

n 

w 

n 

w 

0-50 

1-000 

0-67 

0-932 

0-84 

0-694 

0-51 

1-000 

0-68 

0-923 

0-85 

0-670 

0-52 

0-999 

0-69 

0-914 

0-86 

0-646 

0-53 

0-998 

0-70 

0-904 

0-87 

0-621 

0-54 

0-996 

0-71 

0-894 

0-88 

0595 

0-55 

0-995 

0-72 

0-883 

0-89 

0-567 

0-56 

0-992 

0-73 

0-871 

0-90 

0-538 

0-57 

0-989 

0-74 

0-859 

0-91 

0-506 

0-58 

0-985 

0-75 

0-846 

0-92 

0-472 

0-59 

0-981 

0-76 

0-832 

0-93 

0-435 

0-60 

0-977 

0-77 

0-818 

0-94 

0-396 

0-61 

0-972 

0-78 

0-803 

0-95 

0-352 

0-62 

0-967 

0-79 

0-787 

0-96 

0-304 

0-63 

0-960 

0-80 

0-770 

0-97 

0-249 

0-64 

0-954 

0-81 

0-752 

0-98 

0-187 

0-65 

0-947 

0-82 

0-733 

0-99 

0-112 

0-66 

0-940 

0-83 

0-713 

1-00 

0-000 

*  The  valnes  in  this  table  are  given  by  the  formula 

1  2^e-2«= 

w  —    

27r  n(l-n)' 
where  JV  =  no.  of  experimenta  made. 

The  table  is  quoted  from  F.  M.  Urban:  "The  Method  of  Constant  Stimuli 
and  Its  Generalizations,"  Pnycholofjical  Review,  Vol.  xvii.  1910,  p.  253.  See 
also  "Die  psychophysischen  Massmethoden  als  Grundlagen  empirischer 
Messungen,"  by  the  same  author,  Archiv  f.  d.  ges.  Psychologie,  Vols  xv 
and  XVI. 
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J:  rt3- 

queney 

x' 

1 

I?!****!  lion  /»  V 
JC  1  cljUcIiC^ 

X  x' 

Tr  t*on  n  on  \7 

Frequency 

y' 

X  J/' 

3-5 
18*5 
o 

-3 

_  2 
1 

—  I 

10-5 
37 

l~i  0 

31-5 
74 

irt  0 

6 
4 

o 
o 

-5-5 

—  3 

—  Z 

33 
b 

181-5 
36 

1  o 
J  z 

lo 

U 

— 

y 

—  1 

9 

9 

15-5 

1 

15-5 

15-5 

18 

0 

-60 

10-5 
2-5 

■2 
3 

21 
7  "5 

42 
22-5 

25 
21 

1 
2 

25 
42 

25 
84 

3 

4 

12 

48 

86=iV 

+  67 

347-5 

86  =  ^V 

+  56 

248 

+  7 

1 

-6 

(^i  =  -gg  =  - 0-06977, 


0-/  = 


O'l  = 


86     ^"^'^     12  ' 
2-7955, 
1-67. 


t?2  =  3g  =0-0814, 


347 


-o 


0-o'  = 


86 

=  4-0341, 
.-.  0-2  =  2-013. 


Sheppards  correction.  Kemember  that  a^=^„  andd  =  .\;  see  above  p.  39. 
t  Sheppard  s  correction  cannot  be  used  here,  since  the  units  of  the  sub- 
groups are  not  equal  and  there  is  not  high  contact  at  the  ends  of  the 
frequencies. 
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Frequencies 

x'y'* 

Total  of  fre- 
quencies {/) 

1  +  5-5 -4-5 -1-5 

2-5  +  0-5  +  2-5  +  3-4-5-5-2 

1  +  0-5+0-5-1 

0-5-1 

0-5 

1 

3 

-1 

1 

2 
3 
4 

5-5 

u 
8 
9 
11 

22 

0-5 

-3 
1 

2-5 
-0-5 

0-5 
1 
3 
-1 

+0-5 

-6 

3 

10 

2-75 

4 
9 
33 

22 

71-5-30-75 

>S'(;i;y)= 40-75 

S{xy)'- 

=  S{x'y') 

=  40-75  +  86  X  -0057 


=  41-24, 

 41-24 

~  86  X  1-67  X  2-013 

=  0-143, 

1 

p.E.  =  -67449  . 
=  0071. 

.'.  ^speed  of  addition  =  0-14  +  0-07. 

acc.  of  addition 

*  The  figures  in  italics  in  the  correlation  table. 

t   S{x'y')  =  S(x  +  di){y  +  d2) 

=  S  (xy)  +  dxS(y)  +  d2S(x)  +  Nd^di 

=  S  (xy)  +  Ndidi ,  since  S  (.t)  =  S(y)  =  0. 

.-.    S(xy)  =  S(x'y')-Ndid2. 
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•224 

•050176  X  3^5  = 

•175616 

-1 

•081 

M68561  X  18-5  = 

21-618379 

+ 

•488 

•238144  X  14^5  = 

3-453088 

+ 

•322 

•103684x18  = 

1^866312 

+ 

•145 

•021025  X  15^5  = 

•325888 

+ 

•490 

•2401     xl0-5  = 

2^52105 

+1 

•719 

2^954961  X  2-5  = 

7  •387403 

-1 

414 

1^999396x  3  = 

5^998188 

43-345924 

_  4:3-345924 
~  8G  X  4-034 

=  •1249, 

.-.  7/  =  0-353. 

P.E.  = -67449. 

=  0-064, 

%peed  of  addition  =  0"35  +  0'06. 
acc.  of  addition 
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237  "7  (mean  speed) 


a 


l40        183       220       260       300       340  380" 
Speed  (no.  of  figures  added  in  10  minutes) 

Fig.  16. 

Calculating  rj  from  the  means  of  the  y-arrays,  we  have 


19-68101 


~  86  X  2-796 
=  -0819. 
7?  =  0-29  ±  0  07. 

To  test  the  value  of  77  obtained  from  the  means  of  the 
a;-arrays,  for  linear  regression: 


•67449 


•323 


2  X  -07273 
=  2-22,  i.e.   <  2-5. 

Hence  regression  may  be  considered  to  be  linear. 
Regression  coefficients : 


6]o  =  r—  =  -118, 

0"2 


6,.  =  r-^=-l72. 


0"l 

*  See  p.  59. 
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Equations  to  regression  lines  are 

x-x  =  bu  {y  -  y), 

and  y-y  =  K('^- 

.'.  equation  to  regression  line  AB  is 

y-  164-2  =  -l72(a;-237-7X 

i.e.  y  = -1720; +123-316  (i). 

Similarly  equation  to  line  CD  is 

a; -237-7  =  118  (2/ -164-2), 

i.e.  it;  =  -118y  + 218-324  (ii). 

Equation  (i)  gives  the  most  probable  value  of  y  associated 
with  a  given  value  of  x,  with  a  standard  error 

o-^Vl-rS  i.e.  1-991. 
Similarly,  mtitatis  mutandis,  with  equation  (ii). 
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Example  of  Multiple  Correlation. 
GROUP  I.    {Boys,  ages  11— \2;  n  =  m.) 

1.  Crossing  through  two  letters  (e  and  r), 

2.  Crossing  through  four  letters  (a,  n,  o,  s). 

3.  Combination  test. 

4.  Mechanical  memory  test. 

Formula  for  multiple  correlation  : 

  ^12.34...Jl— 1       ^'l9t.34...?l— 1  '"2JJ.3J...71— 1 

'12.34...?l  —  "T  !• 

For  four  variables  this  becomes : 

_  M2.3       '14.3  '24.3 

'  12.  34  —  1  1  , 

(l-rVa)Ml-rV3)^ 

   ^'l2        ^"]3  ^23  * 

'12.3  —  T  r  • 

TABLE  1. 


Correlation  coefficient 

log  (1  -  »•-) 

12 

0-78 

T-59284 

13 

0-45 

1-90173 

14 

0-40 

1-92428 

23 

0-48 

1-88627 

24 

0-29 

1-96185 

34 

0-52 

1-86308 
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TABLE  11. 


Correlation 
coefficient 
(zero  order) 

Pi'oduct 
term  of 
numerator 

Numerator 

Correlation 
coefficient 
(first  order) 

log(l-r2) 

12 

0-78 

0-2160 

0-5640 

12-3 

0-7199 

T-68281 

13 

0-45 

0-3744 

0-0756 

13-2 

0-1377 

1 -99169 

23 

0-48 

0-3510 

0-1290 

23-1 

0-2308 

1-97623 

12 

0-78 

0-1160 

0-6640 

12-4 

0-7570 

T-63038 

14 

0-40 

0-2262 

0-1738 

14-2 

0-2902 

1-96179 

24 

0-29 

0-3120 

-0-0220 

24-1 

-0-0386 

1-99348 

13 

0-45 

0-2080 

0-2420 

13-4 

0-3091 

J.  yoooy 

14 

0-40 

0-2340 

01 660 

14-3 

0-2176 

T-97893 

34 

0-52 

0-1800 

0-3400 

34-1 

0-4154 

1-91774 

23 

0-48 

0-1508 

0-3292 

23-4 

0-4027 

T-92316 

24 

0-29 

0-2496 

00404 

24-3 

0-0539 

1-99873 

34 

0-52 

0-1392 

0-3808 

34-2 

0-4536 

1-89996 

TABLE  in. 


Correlation 
coefficient 
(first  order) 

Product 
term  of 
numerator 

Numerator 

Correlation 
coefficient 
(second  order) 

log(l-,2) 

12-  4 

13-  4 
23-4 

0-7570 
0-3091 
0-4027 

0-1245 
0-3048 
0-2340 

0-6325 
0-0043 
0-1687 

12-  34 

13-  24 
23-14 

0-727 
0007 
0-272 

T-67345 
1-99998 
1-96662 

12-3 
14-3 
24-3 

0-7199 
0-2176 
0-0539 

0-0117 
0-0388 
0-1567 

0-7082 
0-1788 
-0-1028 

12-34 
14-23 
24-13 

0-727 
0-258 
-0-152 

1-97009 
1-98985 

13-  2 

14-  2 
34-2 

0-1377 
0-2902 
0-4536 

0-1316 
0-0625 
0-0400 

0-0061 
0-2277 
0-41.36 

13-  24 

14-  23 
,34-12 

0-007 
0-258 
0-436 

T-90843 

23-  1 

24-  1 
34-1 

0-2308 
-0-0386 
0-4154 

-  0-0160 
0-0959 

-  0-0089 

0-2468 
-0-1-345 
0-4243 

23-  14 

24-  13 
34-12 

0-272 
-0-152 
0-436 

The  regression  equation  between  changes  in  intelligence 
(as  measured  by  the  combination  test)  and  changes  in  the  other 
three  variables  is 

^3  =  K .  24  «1  +  K.  14  00.,  +  634. 12  «4 . 


APPENDICES 

Calculation  of  regression  coefficients. 

0-1  =  68-19,    0-0  =  58-43,    0-3  =  16-34,    0-4  =  9-70, 
<r\.^...n  =  err  (1  -  rV)  (1  -,-3.,)  (1  -  ...  (1  -  r^,,,...,.-,)- 

0-3.  24  =  0-3  (1-^^32)^(1-^4. 

=  12-77. 

Similarly  0-1.04  =  40-83,  0-3.14=13-27,  o-2.j4  =  36-27,  (73.12  =  14-28, 
0-4.12  =  8-82. 

Again,  612.34...^  =  ^10.34...,,  ^ii^*^ . 

^2.3i...n 

•••&3x.24  =  r3i.,4^=-002. 

0'l.24 

Similarly,  633.14  =  -099,  634.12=  '703. 

Hence  regression  equation  is 

X3  =  002x1  +  099x2  +  "703x4. 

The  standard  error  (0-3.124)  made  in  estimating  from  ocjjX^, 
and     by  this  equation 

=  (73  (1  -  7'\)*  (1  -  r=32.i)*  (1  -  ^4.12)* 
=  12-78. 


APPENDIX  IV 

Regression  Carves  for  Speed  and  Accuracxj  of  Addition. 

GROUP  II. 

— >-  speed 

125      160      195      230      265  300 


200 


7?  =  0-524  ±0-079, 
r  =  0-24 +  0-101, 


■67449 


=  2-16. 


GROUP  III. 


140 


150  • 


160  ■ 


I  70 


o 
OS 


180  - 


190 


200 


V  =  0-38  +  0-09, 
r  =  0-33  +  009. 


10 


146 
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200 


GROUP  IV. 


speed 

56  68 


7)  =  0-60  ±  0  06, 
r  =  0-43  ±0-08, 


•67449 


^  Viy^  -    =  2-28. 


100 


GROUP  V  (a)  {enlarged,  n  =  51). 


,,  =  0-62  +  0  06, 
r  =  0-38 +  010, 


•67449 
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GROUP  VI  (additional  grotip,  40  hoys,  ages  11—12). 

— speed 


122 


135 


146 


161 


o 
u 


174 


187 


200 


85       110      135      160  ^ 


185      910     235  260 


-1  r 


77  =  0-38  ±0  09, 
r  =  0-09  +  0-10, 

iV^^^^^  =  l'7 


Regression  Curve  for  Combination  and.  Mechanical  Memory, 

GROUP  I. 


->-  combination 


1  3        26       39        52       65  78 


7?  =  0  G3  ±  0  05, 
r  =  0-52  +0-0G, 


•67449 


^\/972_r»  =  2-13. 
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"  Difference-hypothesis  "  of  Fechner,  6 
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Discrimination,  10,  34  ;  tests  for  mea- 
suring sensory  discrimination,  87  ff. 
Drawing,  relation  to  other  mental 
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Environment  producing  correlation, 
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Errors  of  observation,  43,  83  £f. 
Euler's  circles,  52 
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with  intelligence,  126 

Mental  tests,  11,  17,  81  ff.,  99  ff. 

Method,  of  Limits,  13  ff.,  33,  34;  of 
Average  Error,  17,  18  ;  of  Constant 
Stimuli,  18  ff.  ;  of  Serial  Groups, 
16,  17  ;  of  Concomitant  Variations, 
42,  132 ;  of  Least  Squares,  30,  55 

Mode,  21,  22,  39 

Moment-coefBcient  or  moment,  37  ff. 
"  Motor"  test,  102,  103 
Miiller-Lyer  Illusion,  105,  106 
Multiple    correlation,    51-53,   63  ff., 
128  ff'. 

"Normal  "  correlation,  128 ff. 
"  Normal "  frequency-distribution,  see 
Gaussian 

Observational  errors,  elimination  of, 
83  ff. 

"Partial"  or  multiple  correlation,  51- 
58,  63 ff.,  121,  124;  probable  error 
of,  65 

Poetry,  memory  for,  109 
"  Point "  binomial,  37 
Practice,  error  due  to  inequalities  of, 
34 

Precision,  measure  of  (h),  26 

Probability  curve,  26  ff. 

Probable  error  (p.e.),  14,  22,  27,  49, 

59  ff.,  73 
Product-moment  formula,  46 


' '  Quotient-hypothesis  "  of  Plateau,  6-8 

Random  sample,  20,  35,  59 
Ranks,  correlation  of,  11,  48,  49,  50, 
71  ff. 

Ratio,  see  Correlation  ratio 

Regression,  46 ;  line,  47,  56,  57  ; 
curve,  55  ;  examples  of,  145-147 ; 
coefficients,  56,  57,  63 ;  criterion  for 
linearity  of,  59 

Reliability,  measures  of,  14,  23  ;  co- 
efficient of,  83 

"  Scatter,"  measures  of,  14,  22,  23,  26 
"  Scatter-diagram,"  54 
Sensation,  see  Intensity 
Sense-distance,  1,  2,  3,  5 
Sexual  mental  differences,  111,  121, 
126,  127 

"  Side-comparisons,"  error  due  to,  34 
Skewness  of  distribution,  22,  39 
Space  error,  15-17 

Spurious  correlation,  76,  77,  113,  120 
Standard  deviation  (a),  22,  26,  27,  39 
Stimulus,  measurement  of  as  an  aid 
to  mental  measurement,  2,  11 

Threshold,  see  Limen 

Time  error,  15-17 

"  Trisection  of  lines"  test,  104,  105 

Two-row  Table,  67  ff. 

Unit  of  measurement,  1 ;  unit  sense- 
distance,  2 ;  j.  n.  d.  as  unit,  4 

Variability,  23,  44,  84,  86  ;  coefBcient, 

86,  125 
Variation,  coefficient  of,  61 
Vertical-Horizontal  Illusion,  106,  107, 

126 

Weber's  Law,  3,  4 ;  interpretations  of, 
8-10 

Weights,  Table  of  CoefiScients  of,  30, 
135 

Weidon's  Experiment,  78,  79 


CAMBRIUOB  :  PnlNTKU  BY  JOHN  CL.VY.  M.A.  AT  THE  UNIVKR8ITV 


